12:56:29 RRSAgent has joined #dpvcg 12:56:29 logging to https://www.w3.org/2021/02/17-dpvcg-irc 12:56:36 ScribeNick: harsh 12:56:40 Meeting: DPVCG Meeting Call 12:56:43 Chair: harsh 12:56:52 Date: 17 02 2021 12:57:09 Agenda: https://lists.w3.org/Archives/Public/public-dpvcg/2021Feb/0006.html 12:58:25 rrsagent, set logs world-visible 13:00:58 Present: paul, delaram, rana 13:01:05 Present: harsh 13:04:00 Present: beatriz, nishad 13:04:07 Topic: Open Issues/Actions on W3C tracker 13:07:20 Present: georg 13:09:24 see issue/tracker https://www.w3.org/community/dpvcg/track/issues/open 13:10:57 Topic: Presentation by Rana 13:11:04 On: Building a Corpus of Physical Health Data Disclosure on Twitter during the Covid-19 Pandemic 13:12:42 rana: users share personal data in social media networks (OSN) which has implications regarding privacy threats 13:13:16 rana: NLP can help with detecting such threats, more specifically: discrimination in job searchers, harassment, bullying, identity theft, misuse of health information 13:14:55 rana: detection of persona ldata or PII is a (current and topical) challenge 13:15:22 rana: PHDD: A Corpus of Physical Health Data Disclosure on Twitter during COVID-19 13:16:03 rana: tweets wre collected using keywords, hastags, regex, and were tagged based on criteria regarding health information or subject 13:17:34 rana: we published corpus in RDF and JSON; for RDF we created lightweight ontology "privacy tags for health information" 13:18:07 rana: DPV provides broad categories for personal data and health categories (physical, mental health) 13:18:28 rana: used HL7 concepts regarding confidentiality and sensitivity 13:19:24 Health Level Seven International (HL7) is a standard for health data https://www.hl7.org/ 13:21:05 rana: descriptive blog with example at https://ranasaniei.now.sh/posts/corpus 13:22:03 rana: future work includes use of supervised ML techniques for detection of health sensitive information, to notify users if shared content are sensitive, implement fine-grained access control mechanism 13:22:44 Q&A / discussion 13:24:21 beatriz: why did you not use the HL7 concepts to tag the tweets? 13:24:57 rana: for this work, we focused on the data subject; whereas the concepts from HL7 regarding sensitivity or confidentiality are relevant for information providers 13:25:44 beatriz: how do you measure confidentiality? 13:26:59 rana: based on contents or related concept in sensitive data categories e.g. special categories in GDPR 13:28:10 paul: why did you break up the text into nouns, pronouns, etc.? 13:29:39 rana: this information is relevant and useful for NLP tasks 13:29:58 harsh: regarding HL7, is this the same as FHIR, or are they tw o separate things? 13:30:24 rana: we used HL7 v4 privacy and security ontology 13:30:54 harsh: do you think we should take up some concepts from HL7 related to privacy/sensitivity? 13:31:08 rana: HL7 is more focused on healthcare, whereas DPV is more generic 13:35:58 Topic: terms proposed by Beatriz and Rana 13:36:03 See https://lists.w3.org/Archives/Public/public-dpvcg/2021Feb/0005.html 13:36:51 beatriz: Rana and myself have been analysing fitness apps policies, and we extracted terms missing in DPV 13:38:57 present: marklizar 13:39:14 There are a lot of fine-grained concepts which might not be suitable to DPV 13:39:20 Date of birth is relevant to age 13:39:40 Some others are not immediately relevant e.g. water intake as they can be broken down into more related concepts e.g. number of glasses 13:49:23 We seem to have a gap in concepts regarding Health, Medical Health, physical health, etc. and the concepts proposed 13:53:31 We need to figure out the proper structure for this in terms of hierarchy 13:53:34 Topic: next meeting 13:54:05 We will move to having a meeting next week 24-FEB 13:00 WET / 14:00 CET 13:54:15 Tentative topic for the agenda is continued discussion on proposed topics 13:54:31 Topic: RDF Vocab tool by Nishad 13:55:17 nishad: https://github.com/zazuko/rdf-vocabularies tool for working with ontologies as datasets / prefixes 13:55:41 nishad: I have added DPV to the tool, so all terms from DPV show up within the term/vocab finder 13:58:08 nishad: so when we update vocabulary, the urls must stay same for the updates 13:59:43 zakim, bye 13:59:43 leaving. As of this point the attendees have been marklizar 13:59:43 Zakim has left #dpvcg 13:59:52 rrsagent, publish minutes v2 13:59:52 I have made the request to generate https://www.w3.org/2021/02/17-dpvcg-minutes.html harsh 14:00:52 rrsagent, bye 14:00:52 I see no action items