{"id":12,"date":"2024-10-21T18:00:39","date_gmt":"2024-10-21T18:00:39","guid":{"rendered":"https:\/\/www.w3.org\/community\/htmlvoc\/?p=12"},"modified":"2024-10-21T18:00:39","modified_gmt":"2024-10-21T18:00:39","slug":"update-on-html-vocabulary-draft-report","status":"publish","type":"post","link":"https:\/\/www.w3.org\/community\/htmlvoc\/2024\/10\/21\/update-on-html-vocabulary-draft-report\/","title":{"rendered":"Update on HTML vocabulary: draft report"},"content":{"rendered":"\n<p>Hi all! <br><br>Exciting news on our end: a first version of a draft report for the community group regarding the HTML vocabulary is available now through the <a href=\"https:\/\/github.com\/floresbakker\/htmlvoc\/blob\/main\/Specification\/HTML%20vocabulary.html\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/htmlvoc\/blob\/main\/Specification\/HTML%20vocabulary.html\" target=\"_blank\" rel=\"nofollow\">Github<\/a>. This specifies the HTML vocabulary in a readable format, using the ReSpec standard. Here one can see a preview of the draft report for the W3C community group:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"490\" src=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1-1024x490.jpg\" alt=\"\" class=\"wp-image-13\" srcset=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1-1024x490.jpg 1024w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1-300x143.jpg 300w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1-768x367.jpg 768w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1-1536x735.jpg 1536w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary1.jpg 1903w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>In this document, one can read the specification of classes, properties, shapes and the like to get an understanding of the HTML vocabulary.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"515\" src=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2-1024x515.jpg\" alt=\"\" class=\"wp-image-14\" srcset=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2-1024x515.jpg 1024w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2-300x151.jpg 300w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2-768x386.jpg 768w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2-1536x772.jpg 1536w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/HTML-vocabulary2.jpg 1910w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>The document was actually generated using <a href=\"https:\/\/github.com\/floresbakker\/OntoReSpec\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/OntoReSpec\" target=\"_blank\" rel=\"nofollow\">OntoReSpec<\/a>, an open source tool based on semantic web technology. Here comes the twist: OntoReSpec uses the HTML vocabulary to generate the very HTML document we are looking at. The HTML vocabulary proves that it works by generating its own specification.<br><br>The HTML vocabulary has changed somewhat since the last time we sent out an update. Not so much a change of course, but more of a finetuning to align as much as possible with the Living Standard of HTML itself.<br><br>Consider for example the algorithm to serialize HTML fragments, modeled through the SHACL node shape <em>shp:HTMLFragmentSerializationAlgorithm<\/em>.  This is the engine of the vocabulary that does all the work.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"513\" src=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1024x513.png\" alt=\"\" class=\"wp-image-16\" srcset=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1024x513.png 1024w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-300x150.png 300w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-768x385.png 768w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1536x769.png 1536w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image.png 1893w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>It now calls several SPARQL functions in order to serialize a HTML document from its leaf nodes up and till the root element and the document containing it. We thus could remove four unnecessary node shapes and improve the efficiency and readability of the vocabulary. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"518\" src=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1-1024x518.png\" alt=\"\" class=\"wp-image-17\" srcset=\"https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1-1024x518.png 1024w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1-300x152.png 300w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1-768x389.png 768w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1-1536x777.png 1536w, https:\/\/www.w3.org\/community\/htmlvoc\/files\/2024\/10\/image-1.png 1891w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>The current version of the HTML vocabulary will now undergo acceptance testing within the Dutch Ministry of Finance; in addition, it has already been proven to work in an OntoReSpec implementation as is mentioned above. <\/p>\n\n\n\n<p>The concepts and methodology behind the HTML vocabulary, basically a form of an abstract syntax tree, are generic and can be applied outside the HTML domain. We came up with an <a href=\"https:\/\/github.com\/floresbakker\/xmlvoc\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/xmlvoc\" target=\"_blank\" rel=\"nofollow\">XML vocabulary<\/a>, with which XML dialects can be modeled. See for instance draft versions of <a href=\"https:\/\/github.com\/floresbakker\/OntoSVG\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/OntoSVG\" target=\"_blank\" rel=\"nofollow\">OntoSVG<\/a> and <a href=\"https:\/\/github.com\/floresbakker\/OntoArchimate\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/OntoArchimate\" target=\"_blank\" rel=\"nofollow\">OntoArchimate<\/a>. Going even further, we can now also apply this to domains such as <a href=\"https:\/\/github.com\/floresbakker\/englishvoc\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/englishvoc\" target=\"_blank\" rel=\"nofollow\">English<\/a>, <a href=\"https:\/\/github.com\/floresbakker\/pythonvoc\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/pythonvoc\" target=\"_blank\" rel=\"nofollow\">Python<\/a> and <a href=\"https:\/\/github.com\/floresbakker\/sparqlvoc\" data-type=\"link\" data-id=\"https:\/\/github.com\/floresbakker\/sparqlvoc\" target=\"_blank\" rel=\"nofollow\">SPARQL<\/a> itself. These are for now just inspiring attempts but can be developed more seriously in the future, just like we did with the HTML vocabulary.<br><br>Are we done yet with the HTML vocabulary? No. There are still some issues, although minor ones. We still want to finetune the documentation, enriching some definitions, perhaps improving OntoReSpec on the way as well. The ReSpec document contains two oddities at this moment: (1) unnecessary warning of 10 <em>duplicate definitions,<\/em> caused by classes and properties that are identically written but for an upper case letter. Think of html:abbr (property) versus html:Abbr (class). ReSpec does not like this. I have raised an issue at the Github for ReSpec as I do not know how to handle this in ReSpec. (2) the &#8220;<em>Latest editor&#8217;s draft:<\/em>&#8221; link in the document refers to a non-existing Github page, of which I do not know its origin. I have added this to another issue at the Github of ReSpec. Finally, and most importantly, we wish to add a rudimentary validation model of the structure of an HTML document. <br><br>All in all it looks like we are going to deliver our community group draft report by the end of the year. We can organize a meeting around that time and see what we as a group still want to do with the specification and how and when we can move forward to a formal working group trajectory.<\/p>\n\n\n\n<p>We&#8217;ll keep you updated. In the mean time, feel free to read the specification, both the HTML document and the turtle file, and let us know what you think. Are you satisfied with the current state of the HTML vocabulary? Do you have specific wishes to improve the ReSpec documentation? Critical complaints and constructive criticisms are welcome, just as well as charming compliments  \ud83d\ude42<\/p>\n\n\n\n<p>Kind regards,<\/p>\n\n\n\n<p><em>Flores Bakker<br>Chair of the HTML vocabulary community group <br>Enterprise Architect @ Dutch Ministry of Finance<\/em><\/p>\n\n\n\n<p><em>Wouter Beek<br>Co-founder @ Triply<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hi all! Exciting news on our end: a first version of a draft report for the community group regarding the HTML vocabulary is available now through the Github. This specifies the HTML vocabulary in a readable format, using the ReSpec &hellip; <a href=\"https:\/\/www.w3.org\/community\/htmlvoc\/2024\/10\/21\/update-on-html-vocabulary-draft-report\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":21511,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_s2mail":"","footnotes":""},"categories":[1],"tags":[6,3,7,5],"class_list":["post-12","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-draft","tag-html","tag-rdf","tag-report"],"_links":{"self":[{"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/posts\/12","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/users\/21511"}],"replies":[{"embeddable":true,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/comments?post=12"}],"version-history":[{"count":16,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/posts\/12\/revisions"}],"predecessor-version":[{"id":32,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/posts\/12\/revisions\/32"}],"wp:attachment":[{"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/media?parent=12"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/categories?post=12"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.w3.org\/community\/htmlvoc\/wp-json\/wp\/v2\/tags?post=12"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}