W3C

– DRAFT –
FHIR RDF

08 May 2025

Attendees

Present
David Booth, Detlef Grittner, Erich Bremer
Regrets
-
Chair
David Booth
Scribe
dbooth

Meeting minutes

Complex Data Types

https://www.lassila.org/publications/2024/HartigEtAl_SPARQLCDTs_PosterPaper2024.pdf

erich: Complex Data Types (CDT) is implemented in Jena
… And RDF lists can be turned into CDT arrays.
… CDT is a proposal
… We also need a standardbinary representation of RDF.
… RDF HDT allows query on it.
… They treat all literals as a string, which doesn't work well for a lot of numeric data, but it's very fast.
… Great for huge amounts of data.
… I used it with Apache Arrow that I made.
… HDF5 is use in the scientific world, like an intelligent zip file. Chunks can be compressed, but you can pull out parts.
… I'm putting the guts of HDT into HDF5.
… OpenLink Software won't touch it if it isn't a standard.
… I harvested all of the imaging commons data and it was 90 TB of data, and pulled out all the metadata in a week.
… I'm working on a SOLID project -- Java library that wraps a read/write storage for linked web storage.
… Want to use it for image annotations and displays.

dbooth: Any SPARQL implementatoins of CDT yet?

erich: Only the Jena SPARQLstore.

erich: CDT paper: https://awslabs.github.io/SPARQL-CDTs/spec/latest.html
… also this: https://www.amazon.science/publications/datatypes-for-lists-and-maps-in-rdf-literals
… I also suggested allowing JSON paths

dbooth: Community group is a good step toward official W3C working group

erich: Need to remove redundancy in the file

detlef: From a theoretical standpoing, the hierarchical approach in DICOM is the right way to do it
… You have to connect to the UIDs of the entities

https://dl.gi.de/items/6ae82b4a-c2c8-4d7e-b45b-088e82080f99

erich: The HDT source code is GPL. I'd rather have it MIT or Apache 2 license.
… but it squeezes the data down a lot.
… I doesn't need to be serialized/deserialized. Like a direct memory copy.
… HDF5 is used for a lot of learning models.
… The ordering of the HDT data is not currently usable.
… I'm stuffing HDT data into HDF5.

ADJOURNED

Minutes manually created (not a transcript), formatted by scribe.perl version 244 (Thu Feb 27 01:23:09 2025 UTC).

Diagnostics

Succeeded: i/hierar/erich: Need to remove redundancy in the file

No scribenick or scribe found. Guessed: dbooth

Maybe present: dbooth, detlef, erich

All speakers: dbooth, detlef, erich

Active on IRC: dbooth