Meeting minutes
preamble
FranckCotton: welcome everybody to the first meeting on the newly chartered Dataset Exchange Working Group
… the group was rechartered following discussions between W3C and the DDI Alliance
… https://
… The DDI Alliance published data-related specifications
… the idea was to bring together expertise of both, to enrich DCAT for different kinds of datasets
… our main deliverable is the Vocabulary Variable Description (tentatively called VVD, pronounced "vivid")
… other tentative deliverables, including Profile-related specifications and Data Quality Vocabulary
… we might want to setup task forces, we will discuss this in due time
… Some initial work on VVD was done during the DDI seminar in Dagstuhl, last year
… for this purpose a Community Group was created
<antoine> https://
<antoine> Use cases draft: https://
<antoine> VVD draft: https://
introductions
FranckCotton: let everyone introduce themselves, describing where they come from and what their expectations from this group are
… I start: I worked at INSEE, the French Institute for Statistics, with different hat, lately in statistical metadata
<Dan> Best if you call on people
FranckCotton: Some of you may know me as one of the organizers of the Semantics Statistics workshop at ISWC; others may know me from Dagstuhl seminars
… I'm completely new to W3C, bear with me :)
… My expectation is for the group to deliver what's on our charter
amerono: Albert Menono, Kings College London
… I have done work on related W3C standards: Data Cube, CSVW, Linked Data Notifications
… I was also involved in the DDI Alliance Dagstuhl workshop that bootstrapped this work
… I sit at the intersection between W3C, DDI Alliance, and also Croissant
… https://
… I think out work here can have an impact on Croissant
… I recently got a grant on "Data for Social"; goal is to make datasets more transparents and "AI ready"
antoine: Antoine Isaac, great to see you all; I'm affiliated with the Free University in Amsterdam,
… main work on Europeana, in the Digital Heritage domain
… noth organizations are based in the Netherland, but I'm based in France
… I've been involved in W3C work for ~20 years, including the previous iteration of the DXWG
… I've been involved with RiccardoAlbertoni in the making of the Data Quality Vocabulary
… My interest in not so much about statistics, more about general dataset descriptions, aligning all the things that are floating around
… Coincidentally, I'm working on adapting Data Sheets for cultural heritage, need to talk with amerono
<antoine> pchampin: I'm on the W3C Team for over 5 years now
<antoine> ... team contact of previous DXWG and invited to Dagtsuhl seminar so worked on re-starting the WG with DDI
<antoine> ... I'm still going to be the team contact, and help with process and delivery of specs
Dan: Dan Gillman, retired from the US Bureau of Statistics
… involved with metadata things for many years; one of the key developers of the CDI standard under DDI Alliance
… I happened to be at the meeting where Data Cube was initially planned
… not much W3C experience
<simon> Dan was at Bureau of Labor Statistics
Dan: I hope to contribute as much as I can
FranckM: Franck Michel, engineer at CNRS (French Center for Scientific Research) and INRIA
… my background is in the Semantic Web community, thesis on Knowledge Graph construction
… W3C experience in the Knowledge Graph Community Group
… some experience in publishing RDF Knowledge Graph, using DCAT, VOID, SPARQL service descriptions, Data Cube
… I met some of you in Dagstuhl last year; my interest is to make datasets truly actionable
… with accurate complete description of the variables
… I will be an editor of the VVD specification
csarven: Sarven Capadisli, member of the W3C Technical Architecture Group
… I have been an editor of various W3C specifications, I worked in the Government Linked Data group that published the Data Cube vocabulary
… worked on publishing various stats datasets via https://
… I also contributed to the DDI RDF vocabulary
… I organized workshop on Semantics Statistics; I joined this group because it has long been a passion of mine
… I want to make these things more tangible for end-users in the context of credibility assessment; linked to my project Dokieli
… https://
… I'd be happy to contribute as an editor of Data Cube
<csarven> I'm https://
RiccardoAlbertoni: Ricardo Albertoni, Institude of Maths and Technology of the Italian CNR
… involved in Knowledge Graphs, reproducibility in AI context
… I have been one of the editors of DCAT
… I'm also one of the editors of DQV
<csarven> s|Capadisli |Capadisli - https://
RiccardoAlbertoni: I expect to contribute to the maintenance of DCAT
… as well as to update the Data Quality Vocabulary
… I'm open to collaborate and contribute to other deliverables
simon: Simon Cox, retired from the Australian Applied Research org
… I've continued to doubled on a lot of things, inc. the W3C revision of the SSN ontology
… also a co-editors of DCAT 2 and 3 (although Riccardo was the driving force in DCAT 3)
… my background is in natural sciences, mostly environmental and geological
… my interest is in keeping DCAT and the associated standards connected to the work done in natural sciences
… I noticed some omissions around the VVD input documents, I'll put them on the table
<pchampin> +1 that's what the WG is about :)
pwin: Peter Winstanley, based in Scotland
<simon> I-ADOPT from RDA and the envsci community
pwin: I'm very grateful to everybody here; I co-chaired the DXWG for a number of years, I'm delighted to see people taking over
… I worked on government data, big data cube, resisting to use DCAT
… other member states in the EU were moving; nice to see the US government pick up DCAT 3
… then I worked several years with Semantic Arts
… hopefully my experience gathered there can be useful for the group; business faces the same sort of challenges as government w.r.t. data datalogs
… academic papers sometimes ignores things that matter in the business world (e.g. Dolce vs. Gist)
… people who don't necessarily know much about RDF have influence in enterprises
… we need to get penetration in the core parts of society, not only academia
<Dan> Standards we work on re voluntary. How do we "guarantee" adoption?
pwin: e.g. make something as pervasive as USB-C
nicholascar: Nicholas Car, based in Brisbane, Australia
… I used to be in simon's research group, moved in the private sector
… I was involved in the previous DX WG, also as an implementor in the PROV and the Sparial Data on the web WGS
… currently a co-chair of the Data Shapes EG
<csarven> s|Semantic Statistics https://
nicholascar: link between SHACL-profiling and the Profile deliverables of this group
… also involved in the OGC and the ISO
… I have a strong interest in the Profile deliverable, but also in cataloguing in general
… I have prepared a bit of a plan for moving from tentative deliverables to real deliverables, with people ready to work as editors
… keen to share that in a future meeting
Joaquin: Joaquin Luciano Salvachua Rodriguez, Polytechnic university of Madrid
… working on semantic interoperability, with DCAT (although I was not active in this group)
… I have been more active in the ODRL group
Jakub: Jakub Klimek, working with the Czec government
… on the government catalogs with DCAT, DCAT-AP and our specific profile of that
… working on that I realized that the specification is not as machine readable as it could be
… one of my goal is to improve the way DCAT (but possibly other specs) can be machine readable
… I recently joined Semic, working on Geo-DCAT-AP and Stat-DCAT-AP
… Also some overlap with the Data-Cube vocabulary
FranckCotton: this took longer than I expected, but it was really interesting to get everybody's perspective
recurring meeting slot
FranckCotton: we had a poll for the timeslot of the recurring meeting
… it is obviously hard to find a slot that work for everyone
… the least bad option seems to be Thursday 10am Boston time
… not ideal for people in Australia, but as discussed earlier, we can't expect to have everybody all the time
… I suggest we start with a weekly meeting, we might cancel some calls during the summer
which would bring the next meeting on 18 June
<simon> 10am Boston will be midnight here, so I unfortunately won't be able to participate in meetings.
<simon> In the other season it will be 02:00 so even worse ...
pchampin: W3C meetings are anchored in Boston time, which matters for Daylight Saving Time. Use the W3C calendar https://
<simon> (not expecting to be heavily involved so my preferences are secondary)
FranckCotton: we should also plan a physical F2F, possibly during TPAC in Dublin in October
<nicholascar> W3C should use Brisbane time - no daylight savings here!
FranckCotton: I sent a Doodle, please respond to this
antoine: about the online calls; as Franck said, this is an important topic for us chairs
… we wouldn't want to have a vicious circle where people's involvement decreases because their choices are not considered
… we are still figuring this out; this will not be easy
… but we don't want to exclude anyone
<pwin> thanks Franck!
FranckCotton: thank you everyone for participating; see you on 18 June
<simon> SSN group meets at 11:00 UTC
<simon> then everyone switches only one hour with the season
<simon> bye