{"id":33,"date":"2021-02-07T17:21:01","date_gmt":"2021-02-07T17:21:01","guid":{"rendered":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/?p=33"},"modified":"2021-02-07T17:21:02","modified_gmt":"2021-02-07T17:21:02","slug":"w3c-prov-some-interesting-extensions-to-the-core-standard","status":"publish","type":"post","link":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/2021\/02\/07\/w3c-prov-some-interesting-extensions-to-the-core-standard\/","title":{"rendered":"W3C PROV: some interesting extensions to the core standard"},"content":{"rendered":"\n<p>The W3C Provenance Working Group defines provenance as the \u201cinformation about entities, activities and people involved in producing a piece of data or thing,  which can be used to form assessments about its quality, reliability or trustworthiness\u201d. The PROV family of specifications [1], Data Model  [2], and OWL ontology (PROV-O) embody this broad concept into a set of W3C Recommendations for use by organisations across data and applications domains .<\/p>\n\n\n\n<p>[1] <a href=\"https:\/\/www.w3.org\/TR\/2013\/NOTE-prov-overview-20130430\/\">https:\/\/www.w3.org\/TR\/2013\/NOTE-prov-overview-20130430\/<\/a><br>[2] <a href=\"https:\/\/www.w3.org\/TR\/prov-dm\/\">https:\/\/www.w3.org\/TR\/prov-dm\/<\/a><br>[3] <a href=\"https:\/\/www.w3.org\/TR\/2013\/REC-prov-o-20130430\/\">https:\/\/www.w3.org\/TR\/2013\/REC-prov-o-20130430\/<\/a><\/p>\n\n\n\n<p>Since 2013, when the model was finalised, a number of organisations around the world have indeed <em><strong>adopted,  embedded and in some cases, extended PROV<\/strong><\/em> as part of their metadata vocabulary, to add value to data repositories by adding support for data exploration and discovery, quality assurance, and  more.<\/p>\n\n\n\n<p>PROV is, by design, a domain-agnostic data model and ontology, but it also provides well-defined extension points that can be used to seamlessly add and integrate domain-specific provenance concepts into the core model. <br>Here we list a small selection of these extensions and the organisations that are responsible for them.<\/p>\n\n\n\n<p>Disclaimer: List below is very partial, in progress, and periodically updated.<\/p>\n\n\n\n<div class=\"toc\">\n<hr>\n<p><H2>List of entries<\/H2><\/p>\n<ol>\n<li><a href=\"#GDPR\">The GDPR Provenance ontology (Trinity College Dublin)<\/a><\/li>\n<li><a href=\"#hl7\">Health Level Seven International (HL7)\u00a0Fast Healthcare Interoperability Resources (FHIR), part of NHS Digital UK<\/a><\/li>\n<li><a href=\"#provone\">The ProvONE ontology (DataONE, USA)<\/a><\/li>\n<li><a href=\"#provcare\">ProvCaRE (USA)<\/a><\/li>\n<li><a href=\"#proms\">Management System Provenance Ontology (CSIRO)<\/a><\/li>\n<li><a href=\"#agr\">The Agreements Ontology (CSIRO)<\/a><\/li>\n<li><a href=\"#dec\">The Decision Provenance (DecPROV) ontology (CSIRO)<\/a>\n<li><a href=\"#dbpedia\">DBPedia DataId<\/a><\/li>\n<li><a href=\"#org\">The organisation ontology<\/a><\/li>\n<li><a href=\"#p-plan\">The P-Plan ontology<\/a><\/li>\n<li><a href=\"#pav\">PAV: Provenance, Authoring and Versioning<\/a><\/li>\n<li><a href=\"#donto\">Donto: Dataset ontology<\/a><\/li>\n<li><a href=\"#sao\">The Stream Annotation ontology<\/a><\/li>\n<li><a href=\"#aml\">AutomationML Ontology\u00a0(aml)<\/a><\/li>\n<li><a href=\"#allo\">The Allotrope Foundation Data Format (ASDF)<\/a><\/li>\n<\/ol>\n<\/div>\n<hr>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"GDPR\">The GDPR Provenance ontology<\/h2>\n\n\n\n<p>GDPRov (pronounced GDPR-Prov) is a linked data ontology for expressing provenance of consent and data lifecycles with a view towards documenting compliance. GDPRov is an OWL2 ontology that extends PROV-O and P-Plan to model the provenance. <a href=\"https:\/\/openscience.adaptcentre.ie\/ontologies\/GDPRov\/docs\/ontology\">https:\/\/openscience.adaptcentre.ie\/ontologies\/GDPRov\/docs\/ontology<\/a><br><strong>Domain<\/strong>: EU Data protection regulations<br>Credit: Trinity College Dublin<br>Latest updated: 2019<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hl7\">Health Level Seven International (HL7)&nbsp;Fast Healthcare Interoperability Resources (FHIR), part of NHS Digital UK<\/h2>\n\n\n\n<p>From the description on the page linked here below: &#8220;Provenance statement indicates clinical significance in terms of confidence in authenticity, reliability, and trustworthiness, integrity, and stage in lifecycle (e.g. Document Completion &#8211; has the artifact been legally authenticated), all of which may impact security, privacy, and trust policies.&#8221;<br>&#8220;The Provenance resource is based on the&nbsp;<a href=\"http:\/\/www.w3.org\/TR\/2013\/NOTE-prov-overview-20130430\/\">W3C Provenance specification&nbsp;<\/a>, and mappings are provided. The Provenance resource is tailored to fit the FHIR use-cases for provenance more directly&#8221;<br><strong>Domain<\/strong>: Healthcare<br><a href=\"https:\/\/www.hl7.org\/fhir\/provenance.html\">https:\/\/www.hl7.org\/fhir\/provenance.html<\/a><br>Credit: HL7 , NHS Digital<br>Latest update: v4.0.1<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"provone\">The ProvOne ontology for scientific workflow provenance<\/h2>\n\n\n\n<p>ProvONE, a standard for scientific workflow provenance representation. ProvONE is defined as an extension of the W3C recommended standard PROV, aiming to capture the most relevant information concerning scientific workflow computational processes, and providing extension points to accommodate the specificities of particular scientific workflow systems.<br><a href=\"https:\/\/purl.dataone.org\/provone-v1-dev\">https:\/\/purl.dataone.org\/provone-v1-dev<\/a><br>Credit: <a href=\"https:\/\/dataone.org\/\">DataONE<\/a>. Working Group Chairs: Prof. B. Ludaescher (UI, USA) and Prof. P. Missier (Newcastle University, UK)<br>Latest updated: 2016<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"provcare\">ProvCaRE (USA)<\/h2>\n\n\n\n<p>The Provenance for Clinical and Healthcare Research (ProvCaRe) framework aims to model, extract, and analyze provenance information to support reproducibility of research studies. The ProvCaRe framework includes the S3 Model that extends the World Wide Web Consortium (W3C) PROV specifications to model provenance metadata describing Study Method, Study Tools, Study Data in a research study. The S3 Model is formalized in the ProvCaRe ontology.<br>https:\/\/provcare.case.edu\/#about<br><strong>Domain<\/strong>: Healthcare<br>Credit: S. Sahoo, funded by a NIH &#8220;Big Data to Knowledge&#8221; grant<br>Latest update: 2019<\/p>\n\n\n\n<p>Paper reference: Satya S. Sahoo, Joshua Valdez, Matthew Kim, Michael Rueschman, Susan Redline, ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata, International Journal of Medical Informatics, Volume 121, 2019, Pages 10-18, ISSN 1386-5056, https:\/\/doi.org\/10.1016\/j.ijmedinf.2018.10.009.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">PROV extensions at CSIRO (Australia)<\/h2>\n\n\n\n<p>CSIRO, Australia&#8217;s national science agency, maintains a number of small PROV extensions, as follows.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"proms\">Management System Provenance Ontology (PROMS): <\/h3>\n\n\n\n<p>A small specialisation of the&nbsp;<a href=\"https:\/\/www.w3.org\/TR\/prov-o\/\">PROV ontology<\/a>&nbsp;used to structure, bundle validate and ultimately manage reports of provenance generated by workflow-like systems. <a href=\"https:\/\/promsns.org\/def\/proms\/proms.html\">https:\/\/promsns.org\/def\/proms\/proms.html<\/a> <br><strong>Domain<\/strong>:<em> <\/em>system management<br>Credit: Nicholas J. Car<br>Latest updated: 2016<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"agr\">The Agreements Ontology<\/h3>\n\n\n\n<p>Designed to model &#8216;agreements&#8217; which are social contracts that include licenses, laws, contracts, Memoranda of Understanding, standards and definitional metadata. Its purpose is to support data sharing by making explicit the relationships between agreements and data and agreements and Agents (people and organisations).<br><a href=\"https:\/\/promsns.org\/def\/agr\/agr.htmlhttps:\/\/lov.linkeddata.es\/dataset\/lov\/vocabs\/agr\" data-type=\"URL\" data-id=\"https:\/\/promsns.org\/def\/agr\/agr.htmlhttps:\/\/lov.linkeddata.es\/dataset\/lov\/vocabs\/agr\">https:\/\/promsns.org\/def\/agr\/agr.html<\/a><br><a href=\"https:\/\/lov.linkeddata.es\/dataset\/lov\/vocabs\/agr\" data-type=\"URL\" data-id=\"https:\/\/lov.linkeddata.es\/dataset\/lov\/vocabs\/agr\">https:\/\/lov.linkeddata.es\/dataset\/lov\/vocabs\/agr<\/a><br><strong>Domain :<\/strong><em> <\/em>human decisions<br>Credit: Nicholas J. Car<br>Latest updated: 2016<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"dec\">The Decision Provenance (DecPROV) ontology<\/h3>\n\n\n\n<p>The <a href=\"http:\/\/promsns.org\/def\/do\">Decision ontology<\/a> (DO) provides basic means for describing decision and decision making. <br>DeCPROV is a re-worked version of the&nbsp;DO that aligns it with the&nbsp;<a href=\"https:\/\/www.w3.org\/TR\/prov-o\/\">PROV ontology<\/a>. This ontology is for modelling decisions and thus the causes for actions or the use or generation of things. It allows for a better understanding of&nbsp;<em>why<\/em>&nbsp;something might have taken place, have been used or produced than the more generic&nbsp;<a href=\"https:\/\/www.w3.org\/TR\/prov-o\/\">PROV ontology<\/a>, on which it is mainly based, does.<br><a href=\"https:\/\/promsns.org\/def\/decprov\/decprov.html\">https:\/\/promsns.org\/def\/decprov\/decprov.html<\/a><br>Domain of impact:document human decisions<br>Credit: Nicholas J. Car<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"dbpedia\">DBPedia DataId<\/h2>\n\n\n\n<p>The DBpedia DataID vocabulary is a meta-data system for detailed descriptions of datasets and their different manifestations, as well as relations to agents like persons or organizations, in regard to their rights and responsibilities.<br>The&nbsp;model integrates DCAT, VoID, <strong>Prov-O<\/strong> and FOAF. Extensions can&nbsp;be&nbsp;made for&nbsp;typical use&nbsp;cases.<br><a href=\"https:\/\/wiki.dbpedia.org\/projects\/dbpedia-dataid\">https:\/\/wiki.dbpedia.org\/projects\/dbpedia-dataid<\/a><br>Credit: Leipzig University, Dbpedia<br><strong>Domain<\/strong>:<em> w<\/em>eb \/ data management<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"org\"><strong>The Organization Ontology<\/strong><\/h2>\n\n\n\n<p>Org, the Core organization ontology, is a vocabulary for describing organisational structures, specialisable to a broad variety of types of organization. It is aimed at supporting linked data publishing of organizational information across a number of domains. It is designed to allow domain-specific extensions to add classification of organizations and roles, as well as extensions to support neighbouring information such as organizational activities.<br>Org builds upon the PROV-O Provenance Vocabulary [<a href=\"https:\/\/www.w3.org\/TR\/vocab-org\/#bib-prov-o\">prov-o<\/a>].<br><a href=\"https:\/\/www.w3.org\/TR\/vocab-org\/.\">https:\/\/www.w3.org\/TR\/vocab-org\/.<\/a><br>Credit: <a href=\"http:\/\/www.w3.org\/2011\/gld\/\">Government Linked Data Working Group<\/a>&nbsp;<br>Latest update: 2014<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"p-plan\"><strong>The P-Plan ontology<\/strong><\/h2>\n\n\n\n<p>P-Plan is an OWL2 ontology developed to describe abstract scientific workflows as plans and link them to their past executions. P-Plan extends the <strong>W3C PROV-O Ontology<\/strong>, which encodes the W3C PROV data model [<em>PROV-DM<\/em>]. PROV-DM describes the provenance of objects (prov:Entities) as a record of assertions about the steps (prov:Activities) that generated them and the entities used in those steps. Provenance describes past execution, but does not offer a vocabulary to express the&nbsp;plan&nbsp;that the execution was supposed to follow.<br>As an example, provenance vocabularies are appropriate for describing assays once they are executed, but are not designed to describe protocols. Therefore, in addition to the provenance record, it is often desirable to publish the plan that was followed during the execution. This would allow the provenance record to include what was envisioned would happen prior to the execution.<br><a href=\"https:\/\/www.opmw.org\/model\/p-plan\/\">https:\/\/www.opmw.org\/model\/p-plan\/<\/a><br><strong>Domain<\/strong>: Scientific workflows<br>Credit: Leipzig University (http:\/\/aksw.org\/Projects.html)<br>Latest updated: 2013<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"pav\">PAV: Provenance, Authoring and Versioning<\/h2>\n\n\n\n<p>PAV is a lightweight ontology for tracking Provenance, Authoring and Versioning. PAV specializes the W3C provenance ontology PROV-O in order to describe authorship, curation and digital creation of online resources.<br><a href=\"https:\/\/pav-ontology.github.io\/pav\/\">https:\/\/pav-ontology.github.io\/pav\/<\/a><br><strong>Domain<\/strong>: provenance of digital creations and of online resources<br>Credit:  Paolo Ciccarese<a href=\"http:\/\/orcid.org\/0000-0002-5156-2703\">, Novartis NIBR.<\/a> Stian Soiland-Reyes, University of Manchester<br>Latest update: 2015<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"donto\">Donto: Dataset ontology<\/h2>\n\n\n\n<p>The data.gov.au Dataset Ontology is an OWL ontology designed to describe the characteristics of datasets published on data.gov.au. <br>The <em>Dataset Ontology<\/em>  contains elements which describe the publication, update, origin, governance, spatial and temporal coverage and other contextual information about the dataset. The ontology also covers aspects of organisational custodianship and governance.<br>By using this ontology publishers increase discoverability and enable the consumption of this information in other applications\/systems as Linked Data. It further enables decentralised publishing of catalogs and facilitates federated dataset search across sites, e.g. in datasets that are published by the States.<br><a href=\"https:\/\/data.gov.au\/data\/dataset\/data-gov-au-dataset-ontology\/resource\/dc586d4f-a3a5-4e00-abb4-128277356bed\">https:\/\/data.gov.au\/data\/dataset\/data-gov-au-dataset-ontology\/resource\/dc586d4f-a3a5-4e00-abb4-128277356bed<\/a><br><strong>Domain<\/strong>: Government Open Data, Data Management<br>Credit: Australian Government<br>Latest update: 2017<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"sao\">The Stream Annotation ontology<\/h2>\n\n\n\n<p>Representing IoT data streams is an important requirement in semantic stream data applications, as well as in knowledge-based environments for Smart Cities. This ontology aims to semantically represent the features of a data stream defining the specifications of an information model on top of&nbsp;<a href=\"http:\/\/www.w3.org\/2005\/Incubator\/ssn\/ssnx\/ssn\">Semantic Sensor Networks (SSN)<\/a>,&nbsp;<a href=\"http:\/\/www.w3.org\/TR\/prov-o\/\">PROV-O<\/a>&nbsp;and&nbsp;<a href=\"http:\/\/motools.sourceforge.net\/timeline\/timeline.html\">TimeLine<\/a>&nbsp;Ontologies, and involves connections with the Complex Event Processing Ontology and Quality Ontology. <br><a href=\"http:\/\/iot.ee.surrey.ac.uk\/citypulse\/ontologies\/sao\/sao\">http:\/\/iot.ee.surrey.ac.uk\/citypulse\/ontologies\/sao\/sao<\/a><br><strong>Domain<\/strong>: Internet of Things \/ Smart Citites<br>Credit: University of Surrey<br>Latest update: 2016<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"aml\"><strong>AutomationML Ontology&nbsp;(aml)<\/strong><\/h2>\n\n\n\n<p>A vocabulary to represent the XML-based AutomationML Standard &#8211; IEC 62714.AutomationML, with additional support for the aspects of multisensor fusion and integration is proposed to create a complete knowledge-based solution&nbsp;for manufacturing and automation domains. Reference: Gon\u00e7alves, E., Freitas, A., &amp; Botelho, S. (2019). An AutomationML Based Ontology for Sensor Fusion in Industrial Plants.&nbsp;Sensors (Basel, Switzerland),&nbsp;19(6), 1311. https:\/\/doi.org\/10.3390\/s19061311<br><a href=\"http:\/\/i40.semantic-interoperability.org\/automationml\/Documentation\/index.html\">http:\/\/i40.semantic-interoperability.org\/automationml\/Documentation\/index.html<\/a><br><strong>Domain<\/strong>: manufacturing and automation<br>Credit: TU Vienna, Fraunhofer IAIS<br>Lastest update: 2016<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"allo\">The Allotrope Foundation Data Format (ADF)<\/h2>\n\n\n\n<p>The Allotrope Data Format Audit Trail and Electronic Signatures Specification [ADF-A] is a specification on how to use audit trails in the Allotrope Data Format [ADF] in a standardized way. The specification is both about the Allotrope Audit Trail Ontology and the Audit Trail API that is part of the [<a href=\"http:\/\/docs.allotrope.org\/TR\/adf-audit\/ADF%20Audit%20Trail.html#bib-ADF\">ADF<\/a>] APIs.<br><a href=\"http:\/\/docs.allotrope.org\/TR\/adf-audit\/ADF%20Audit%20Trail.html#bib-AF\">http:\/\/docs.allotrope.org\/TR\/adf-audit\/ADF%20Audit%20Trail.html#bib-AF<\/a><br><strong>Domain<\/strong>: Capturing data from laboratory analytical processes<br>Credit: The Allotrope foundation <a href=\"https:\/\/www.allotrope.org\">https:\/\/www.allotrope.org<\/a><br>Latest update: 2017<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The W3C Provenance Working Group defines provenance as the \u201cinformation about entities, activities and people involved in producing a piece of data or thing, which can be used to form assessments about its quality, reliability or trustworthiness\u201d. The PROV family of specifications [1], Data Model [2], and OWL ontology (PROV-O) embody this broad concept into &hellip; <a href=\"https:\/\/blogs.ncl.ac.uk\/paolomissier\/2021\/02\/07\/w3c-prov-some-interesting-extensions-to-the-core-standard\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">W3C PROV: some interesting extensions to the core standard<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":9037,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[3,6],"class_list":["post-33","post","type-post","status-publish","format-standard","hentry","category-uncategorised","tag-provenance","tag-prov-2"],"_links":{"self":[{"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/posts\/33","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/users\/9037"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/comments?post=33"}],"version-history":[{"count":1,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/posts\/33\/revisions"}],"predecessor-version":[{"id":34,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/posts\/33\/revisions\/34"}],"wp:attachment":[{"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/media?parent=33"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/categories?post=33"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ncl.ac.uk\/paolomissier\/wp-json\/wp\/v2\/tags?post=33"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}