Introduction:
I am an Enterprise Data Architect. We want to exchange our data catalog with other entities, so we want to store our metadata in a machine-readable format so that we can exchange it. The W3C DCAT 3.0 standard is particularly suitable for this.
I am looking for an Alteryx workflow example that can write the metadata of a dataset to a DCAT 3.0 RDF file.
Need:
The workflow must perform the following steps:
Context:
The DCAT 3.0 standard is described here:
https://www.w3.org/TR/vocab-dcat-3/
The Field Info tool is your best friend: https://help.alteryx.com/current/en/designer/tools/developer/field-info-tool.html
As to DCAT RDF, I am not sure what it looks like. Do you have a sample? If it's delimited in some way or if it can be read by Notepad++, then what you can do is use the Output tool and choose ".csv", but change the extension in the output string to .rdf or its appropriate extension.
Below an example of a DCAT description of a Customer Dataset with additionally a defintion of the Customer Concept used in the dataset.
@prefix rdf: <https://www.w3.org/1999/02/22-rdf-syntax-ns#>
@prefix dct: <http://purl.org/dc/terms/>
@prefix dcat: <http://www.w3.org/ns/dcat#>
@prefix skos: <http://www.w3.org/2004/02/skos#>
<https://example.com/customer-dataset>
a dcat:Dataset ;
dct:title "Customer Dataset" ;
dct:description "A dataset containing information about customers" ;
dcat:keyword "customer, sales, marketing" ;
dcat:publisher <https://example.com/organization> ;
dcat:distribution [
a dcat:Distribution ;
dcat:downloadURL <https://example.com/customer-dataset.csv> ;
dcat:mediaType "application/csv" ;
dcat:format [
a dcat:MediaType ;
dcat:name "CSV" ;
dcat:extension ".csv" ;
] ;
] ;
dcat:landingPage <https://example.com/customer-dataset> ;
dcat:license <https://creativecommons.org/licenses/by/4.0/> ;
dcat:theme skos:Theme [
a skos:Concept ;
skos:prefLabel "Customer Management" ;
] ;
dcat:subject skos:Concept [
a skos:Concept ;
skos:prefLabel "Customer" ;
skos:broader skos:Concept [
a skos:Concept ;
skos:prefLabel "Person" ;
] ;
].
Here is a breakdown of the DCAT file:
Okay then my previous response may not be valid anymore. DCAT RDF's output file type looks very foreign to me, and I don't think my recommendation works.
Can this be done with Python or R? If yes, you can port over the script into Alteryx via the R or Python tools and output accordingly as well.
Not sure if relevant: https://community.alteryx.com/t5/Alteryx-Server-Discussions/Is-it-possible-to-execute-Alteryx-workfl...
Thanks for your support, I really appreciate this. I'm a little bit surprise that there is no info on RDF formats available in Alteryx. DCAT and SKOS are W3C standards and widely used.
Perhaps someone from Alteryx can chip in here. This is beyond me as a community member to answer, but if others ave experienced it, then I think their opinions value more than mine in this case.
Alternatively, if Alteryx really doesn't have anything on this, you can suggest this as an idea. You can also go through your CSM as part of the Voice of the Customer initiative they have. Your CSM or even Alteryx assigned engineer is better able to answer you in that regard.
Hope this helps somewhat @pgrooten
Thanks! Good advice! I will contact my CSM for this.