IDN Catalogue Profile - Specification

This is the IDN Catalogue Profile specification that defines all elements of the profile's metadata model.

Venn diagram of this profile's concerns

Figure 1: This profile overlaps concerns with the DCAT and PROV models and provides for the description of data according to several other assessment models, such as FAIR, CARE etc. The overlaps do indicate which models are needed to cover requirements of the assessment models.

Metadata

URI
https://data.idnau.org/pid/cp/spec
Is Part Of
IDN Catalogue Profile
Publisher(s)
Indigenous Data Network
Creator(s)
Nicholas J. Car
Dates
Created 2022-03-18
Issued 2022-07-19
Modified 2024-07-02
Version
0.0.2
License
Attribution 4.0 International (CC BY 4.0)
Copyright
Indigenous Data Network, 2022 - 2024

Preamble

Abstract

This document is the formal data model specification of the IDN Catalogue Profile which is a profile of DCAT, the Data Catalog Vocabulary. Where DCAT is designed for the representation of data catalogues and their content in general, this profile of it aims to cater for the enhanced representation of indigenous data governance.

Most of the rules of this profile just implement conventions for the use of DCAT and introduce few or no new model elements.

Related parts of the IDN Profile:

For the list of all resources within this profile, see the profile definition:

Namespaces

This document refers to elements of various ontologies by short codes using namespace prefixes. The prefixes and their corresponding namespaces' URIs are:

idncphttps://data.idnau.org/pid/cp
dcathttp://www.w3.org/ns/dcat#
dctermshttp://purl.org/dc/terms/
owlhttp://www.w3.org/2002/07/owl#
profhttp://www.w3.org/ns/dx/prof/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
rdfshttp://www.w3.org/2000/01/rdf-schema#
ricohttps://www.ica.org/standards/RiC/ontology#
rolehttp://www.w3.org/ns/dx/prof/role/
sdohttps://schema.org/
skoshttp://www.w3.org/2004/02/skos/core#
xsdhttp://www.w3.org/2001/XMLSchema#

Conformance

The key words MAY, MUST, MUST NOT, and SHOULD are to be interpreted as described in RFC2119.

The Examples in this document are snippets of RDF data formulated according to the Turtle syntax.

1. Introduction

This introduction section provides direct answers to likely early questions about this profile, specifically:

  1. What is the purpose of this Profile? See Purpose.
  2. What does basic use of this profile look like? See Basic Use.
  3. What minimum elements are required? See Minimum Metadata.
  4. What are all the recommended elements? See All Elements.
  5. What data formats can I use for metadata? See Formats.

1.1 Purpose

What is the purpose of this Profile?

Common catalogue models provide lots of possible metadata elements which can be used to describe catalogued items, to categorise them, indicate how they are managed and so on but don't specifically handle Indigenous concerns.

The purpose of this profile is to allow for the cataloguing of things created by or about Indigenous people or about Indigenous things and to be able to assess whether they are governed well, from an Indigenous perspective.

This Profile is just a recommendation to use a common catalogue model - DCAT, the Data Catalog Vocabulary - in particular ways which are called patterns - see the Patterns Section below.

1.2 Basic Use

What does basic use of this profile look like?

Profile class structure overview

Figure 2: This profile requires basic use of the DCAT model's elements for cataloguing with model elements from several other models for classification and relating catalogued resources to people and organisations. This is all common DCAT practice.

This IDN Catalogue Profile builds on the DCAT, the Data Catalog Vocabulary, which requires Catalog instances to contain Resource instances, and some DCAT-recommended ways of expressing relationships from resources to people & organisations - Agents - using PROV, the Provenance Ontology. Resources can be categorised using vocabulary terms defined using SKOS, the Simple Knowledge Organization System and a few special relationships are recommended to link resources to policies, such as data governance policies and licenses.

To cater for Indigenous data governance concerns, this profile requires, or at least expects/promotes the use of, specific catalogues of Agent instances and specific vocabularies of Concept instances for to classify Resource instances with.

So, when using regular DCAT, you might indicate that a Resource was created by a person identified by an ORCID, an international researcher identifier, using this profile you might indicate the creator using an identifier from the Indigenous Data Network's Agents Database which stores additional Indigenous-relevant information about people and groups.

Also, when using DCAT, you might just indicate the theme of data by classifying a Resource with a Fields of Research Code. When using this profile you might still do that but might also select a more nuanced Indigenous research topic from the IDN Themes vocabulary.

In addition to requiring the use of some DCAT-promoted patterns over others and recommended use of particular reference catalogues & vocabularies, this profile also constrains the ways in which several other DCAT properties should be used. All the constraints are still normal DCAT-style constraints.

1.3 Minimum Metadata

What minimum elements are required?

For the cataloguing of resources according to this profile, we need at least the following information:

Property Notes
Identifier

Either you have one, or we can create one for you.

See the Identifiers section below

Title Just plain text
Description A single sentence at least, up to multiple paragraphs. Just plain text though
Agents

Persons Organisations associated with the resource and the roles they play.

Creator
The ID of a Person or an Organisation in our IDN Data Agents DB
Custodian

As for Creator: the ID of a Person or an Organisation in our IDN Data Agents DB.

Can be the same as, or different to, Creator

Contact Point

The ID or details of a Person or a part of an Organisation that can be contacted regarding this resource.

This person/org does not have to be listed in any database such as the IDN Data Agents DB and may be automatically assigned to this resource.

Created/Published

The date/year etc. the data was created or published.

This property is recommended but optional: If you really don't know this, leave it out.

See the Dating section below

Modified

The date/year etc. the data was last modified/updated.

This property is recommended but optional: If you really don't know this, leave it out.

See the Dating section below

Themes (keywords)

Any number of preferably vocabulary concepts but also perhaps plain text themes/categories/keywords classifying the resource.

This property is recommended but optional.

See the Theming section below

Policy details

A reference (hyperlink/description) to any (Indigenous) data policy that pertains to this specific resource or the class of resource this is in. Includes access rights and licenses.

This property is recommended for governance assessment.

See the Policies section below

A catalogued resource with minimal metadata:

For this resource we have only:

Property Notes
Identifier https://trove.nla.gov.au/work/10128420
Type http://purl.org/dc/dcmitype/PhysicalObject
Title Tjukinya
Description Folktale: The little red hen retold in Pitjantjatjara
Agents
Creator
Roderick Munti
Publisher
Summer Institute of Linguistics
Custodian
National Library of Australia
Published 1980
Themes (keywords) Pitjantjatjara
Policy details
Access rights
Not for Inter-Library Loan

It has a simple title, brief description and a few Agents (organisations & persons) associated with it. This item is easily understood to be Indigenous as it is themed with an indigenous language - Pitjantjatjara - and controlled access is indicated.

This item will gain some form of FAIR and CARE scores, but they could be improved: see the next section.

1.4 All Elements

What are all the recommended elements?

In general, all the metadata elements that can be used in accordance with this profile are those of the DCAT specification. DCAT contains lots of attributes for lots of purposes, so most resources will not sensibly use all of them.

More specifically, there are other schemes, including profiles of DCAT, that recommend specific elements for use, beyond those listed in the previous section.

Here we make two specific recommendations for particular DCAT metadata elements:

  1. Australian government-aligned metadata
  2. Maximal Indigenous content representation

1.4.1 Australian government-aligned metadata

This recommendation is to use metadata elements aligned with the National Data Commissioner's Guide on Metadata Attributes.

In addition to the elements listed in the minimum metadata section above:

Property Notes
Distribution information

How to technically access this resource, for example:

  • an access URL
  • file format and data size
  • shelf number / physical collection location
  • distribution-specific instructions

See the Distribution Class details in the Model Section.

Temporal coverage

The time period covered by the data. can eb indicates as a date, date range, year range or named time period.

See the Temporality pattern section.

Spatial coverage

The spatial area covered by the data

Note this might be given as a geometry - point or polygon - but may best be given as a link to an online spatial object, e.g. administrative areas within the ASGS online.

See the Spatiality pattern section.

1.4.2 Maximal Indigenous content representation

This recommendation is to use the same metadata elements as listed above for minimal metadata and the National Data Commissioner's Guide on Metadata Attributes but with specific sources for particular values.

In addition to the elements listed in the minimum metadata section above:

Property Notes
Agents Use identifiers for Agents registered within the IDN's Agents DB. This ensures that the Indigenous relations of the Agent, if any, are known and can be used to assess whether they are a representative of the targets of the resource information, for governance purposes.
Themes

Use vocabularies specifically created to indicate Indigenous aspects of data, for example:

These vocabularies build on widely used general-purpose vocabularies, such as the ARDC's Fields of Research codes but contain detailed Indigenous modelling.

Policy details

The particular policies that are sought are:

  • Indigenous Data Governance Policy
  • general Data Governance Policy
  • data license

Best representation in metadata would be to link to online copies of the policies and to indicate their type from the IDN's Policy Types vocabulary

Spatial coverage

Indicating the spatial coverage of the resource, or its place of origin, with reference to a names spatial feature, in particular Indigenous spatial features, such as those provided by the Indigenous Data Network's reference Spatial Data Catalog.

1.5 Format

What data formats can I use for metadata?

The IDN Catalogue Profile uses the same machine-readable data format used by DCAT: Resource Description Framework (RDF).

RDF is widely supported and available in multiple syntaxes or "flavours" such as JSON and XML, as well as custom formats made for readability, such as Turtle.

It's critical that metadata is either supplied in one of the RDF syntaxes so that it can be validated with this profile's supplied validators - see the Validation Section.

Here follows 3 RDF syntax versions of the Tjukinya resource shown in the Minimum Metadata section above.

1.5.1 Turtle

Turtle is a text-based, reasonably human-readable syntax for RDF.

PREFIX dcat: <http://www.w3.org/ns/dcat#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX prov: <http://www.w3.org/ns/prov#> PREFIX sdo: <https://schema.org/> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#> <https://trove.nla.gov.au/work/10128420> a dcat:Resource ; dcterms:type <http://purl.org/dc/dcmitype/PhysicalObject> ; dcterms:title "Tjukinya" ; dcterms:description "Folktale: The little red hen retold in Pitjantjatjara" ; dcterms:creator <https://example.com/agent/123> ; dcterms:publisher <https://example.com/agent/123456> ; dcat:theme <https://data.idnau.org/pid/austlang/C6> ; prov:qualifiedAttribution [ prov:hadRole <https://linked.data.gov.au/def/data-roles/custodian> ; prov:agent <https://linked.data.gov.au/org/nla> ] ; dcterms:published "1980"^^xsd:gYear ; dcterms:theme <https://data.idnau.org/pid/austlang/C6> ; dcat:accessRights "Not for Inter-Library Loan" ; .

Note that the reference to the publisher, the Summer Institute of Linguistics, is by persistent identifier web address, https://example.com/agent/456, that resolves to more information about the Institute. This particular identifier is from the IDN Agents Database, so the IDN potentially knows more information about the Institute than just basic facts such as its name, for example whether it is an Indigenous-led organisation.

1.5.2 JSON-LD

JSON-LD is a convention for the use of JSON data that can represent RDF data. It is a modern syntax used by modern Web systems for data transfer.

This example JSON-LD has been converted from the Turtle data above and can be back-converted perfectly.

{ "@context": { "@vocab": "http://purl.org/dc/terms/", "accessRights": "http://www.w3.org/ns/dcat#accessRights", "theme": "http://www.w3.org/ns/dcat#theme", "Resource": "http://www.w3.org/ns/dcat#Resource", "year": "http://www.w3.org/2001/XMLSchema#gYear", "PhysicalObject": "http://purl.org/dc/dcmitype/PhysicalObject", "agent": "http://www.w3.org/ns/prov#agent", "hadRole": "http://www.w3.org/ns/prov#hadRole", "attribution": "http://www.w3.org/ns/prov#qualifiedAttribution", "austlang": "https://data.idnau.org/pid/austlang/C6", "data-roles": "https://linked.data.gov.au/def/data-roles/" }, "@graph": [ { "@id": "https://trove.nla.gov.au/work/10128420", "@type": "Resource", "type": { "@id": "http://purl.org/dc/dcmitype/PhysicalObject" }, "title": "Tjukinya", "description": "Folktale: The little red hen retold in Pitjantjatjara", "creator": { "@id": "https://example.com/agent/123" }, "publisher": { "@id": "https://example.com/agent/123456" }, "attribution": { "@id": "_:nb0e54b223f37485b8555e8d45b468bf9b1" } "published": { "@type": "year", "@value": "1980" }, "theme": { "@id": "https://data.idnau.org/pid/austlang/C6" }, "accessRights": "Not for Inter-Library Loan", }, { "@id": "_:nb0e54b223f37485b8555e8d45b468bf9b1", "agent": { "@id": "https://linked.data.gov.au/org/nla" }, "hadRole": { "@id": "data-roles:custodian" } } ] }

1.5.3 XML

XML is still commonly used as a format for metadata but care must be taken here: RDF/XML, a specific formulation of XML, must be used for XML data, not general-purpose XML.

The following RDF/XML has been generated from the Turtle example above.

<?xml version="1.0" encoding="utf-8"?> <rdf:RDF xmlns:dcat="http://www.w3.org/ns/dcat#" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:prov="http://www.w3.org/ns/prov#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" > <rdf:Description rdf:about="https://trove.nla.gov.au/work/10128420"> <rdf:type rdf:resource="http://www.w3.org/ns/dcat#Resource"/> <dcterms:type rdf:resource="http://purl.org/dc/dcmitype/PhysicalObject"/> <dcterms:title>Tjukinya</dcterms:title> <dcterms:description>Folktale: The little red hen retold in Pitjantjatjara</dcterms:description> <dcterms:creator rdf:resource="https://example.com/agent/123"/> <dcterms:publisher rdf:resource="https://example.com/agent/123456"/> <prov:qualifiedAttribution rdf:nodeID="n09d0151a1e6140cfb188576200eb9d69b1"/> <dcterms:published rdf:datatype="http://www.w3.org/2001/XMLSchema#gYear">1980</dcterms:published> <dcat:theme rdf:resource="https://data.idnau.org/pid/austlang/C6"/> <dcat:accessRights>Not for Inter-Library Loan</dcat:accessRights> </rdf:Description> <rdf:Description rdf:nodeID="n09d0151a1e6140cfb188576200eb9d69b1"> <prov:hadRole rdf:resource="https://linked.data.gov.au/def/data-roles/custodian"/> <prov:agent rdf:resource="https://linked.data.gov.au/org/nla"/> </rdf:Description> </rdf:RDF>

1.5.4 Other formats

All standard RDF formats may be used. As a guide, those able to be handled by the RDFLib code library are fine.

If you need to create RDF data for the first time, you can use the:

2. Patterns

TODO: intro what patterns are

2.1 Identifiers

This Profile relies on web address-style identifiers for things, so you will to provide, or ask us to create, something like this:

That is the identifier for the Koori Health Research Database dataset.

It's unique ID bit is its acronym 'KHRD' and the https://data.idnau.org/pid/ bit is what the IDN uses to make unique ID's into web addresses that work - are clickable.

Existing Identifiers

If you already have a web address for your data, use that. For example, the Australian Government Indigenous Programs & Policy Locations (AGIL) dataset is already listed in the data.gov.au catalogue at https://data.gov.au/data/dataset/34b1c164-fbe8-44a0-84fd-467dba645aa7, so we continue to use that web identifier. See our listing of that dataset.

2.2 Indigeneity

2.3 Roles

All Agents, that is Organisations & People, associated with resources such as datasets in IDN metadata need to have their role defined.

The roles we start with are those defined in the well-known ISO's Role Codes vocabulary.

This vocabulary contains standard roles such as:

  • author
  • co author
  • custodian
  • point of contact

To this vocabulary, we have added:

  • subject agent
  • subject agent representative

Some of these roles are present in the Full Example below.

In the technical versions of the examples below, you will see how pairs of Roles and Agents are grouped together in metadata so multiple Agents and Roles may be described for a single resource.

2.4 Agents

All Agents, that is Persons or Organisations, referred to by IDN metadata MUST be registered within the IDN Data Agents DB which is online at coming soon, Jan, 2023!

When registered, Agents are given a persistent identifier web address. Such identifiers may have been created elsewhere, or may be assigned by the Data Agents DB.

For example, here is the persistent identifier for the Australian Data Archive:

This identifier resolves - you can click on it and go to a web page - and it has been assigned by a party other than the IDN, in this case, the Australian Government Linked Data Working Group, which is a source fo many organisational persistent identifiers.

This identifier is present in the Full Example below.

Here is an example of an identifier for an Organisation supplied by the IDN:

Note the use of the IDN's namespace, https://data.idnau.org/.

If you enter a new Organisation or Person into the Data Agents DB, you will automatically be assigned a new persistent identifier of this form unless you quote an existing one.

2.5 Dating

For Created & Modified and all other dates, you are free to use any one of the following formats, but you do have to pick one and not make up your own format:

Type Format Example
Date yyyy-mm-dd 2021-02-28
Year yyyy 2021
Year/Month yyyy-mm 2021-02
Date/Time yyyy-mm-ddTHH:MM:SS 2021-02-28T14:02:23

Please do not use junk values like 00:00:00 for time: select a format that effectively conveys what you know about the data.

2.6 Theming

TODO

2.7 Policies

TODO

3. Model

Profile class structure overview

Figure 3: Overview of elements of the DCAT, PROV & SKOS models and relationships between them used in this profile. Classes of object are shown as chamfered rectangles except for Agent which is shown as a 'house'. Enclosed arrows indicate the rdfs:subClassOf relationship. Pink arrows indicate specific value properties.

3.1 Classes

3.2 Predicates

3.3 Axioms

4. Validation

The requirements for data to conform to this profile are listed here. They are organised into the following subsections:

Where model objects are indicated in code and no namespace is indicated, presume the dcat, prov or skos namespaces are used. There are no class/property overlaps between those models, so there can be no confusion if the object is looked up in those standards.

4.1 Structural Requirements

The following sub-subsections are per object class, where 'class' is one of the classes in this profile's structural overview (see Figure 2 above).

The general idea is that objects of the various class types in the overview diagram are needed for IDN metadata and this section describes their required properties and relations. For example, all Resource instances must be related to at least one Catalog instance.

2.1.2 Catalogue

Req C1: Being a Resource, each Catalog instance MUST also adhere to the Resource Requirements R1, R2, R3 & R4.

Req C2: Each Catalog instance MUST contain at least one Resource indicated by the hasPart property.

Example: A Catalogue instance with minimum required metadata <https://doi.org/a-doi-for-the-catalogue> a dcat:Catalog ; dcterms:title "Catalogue X" ; dcterms:description "An example catalogue"@en ; dcterms:created "2022-07-21"^^xsd:date ; dcterms:modified "2022-07-25T20:15:21"^^xsd:dateTime ; prov:qualifiedAttribution [ a prov:Attribution; prov:agent <https://linked.data.gov.au/org/idn> ; # Indigenous Data Network's PID ex:hadRole :custodian ; ] , [ a prov:Attribution; prov:agent <https://orcid.org/0000-0002-8742-7730> ; # Nicholas Car's IRI ex:hadRole :pointOfContact ; ] ; dcterms:hasPart ex:some-resource-y , ex:some-resource-z , ... .

2.1.3 Resource

Req R1: Each Resource instance SHOULD indicate a persistent identifier to be used to gain access to its point-of-truth metadata. The PID SHOULD be used as the Dataset instance's IRI, if it is an HTTP/HTTPS IRI, or else it SHOULD be quoted as a literal value, indicated with the property dcterms:identifier of datatype xsd:token.

Req R2: Each Resource instance MUST provide basic Resource metadata so that exactly one of each of the following properties is required with range value as per Resource requirements: title, description, created & modified.

Req R3: Allowed Semantic Web date/time properties for created & modified properties are xsd:date, xsd:dateTime, xsd:dateTimeStamp, time:Interval

Req R4: Each Resource instance MUST NOT indicate Agent roles with direct DCAT properties (e.g. publisher) and MUST indicate them with the DCAT-recommended PROV qualified attribution pattern with each prov:Attribution indicated with the property prov:qualifiedAttribution.

Req R5: Each Resource instance, if it is not a Catalog instance MUST and if it is a Catalog instance MAY indicate that it is within at least one Catalog instance with an in-bound hasPart property from a Catalog instance.

Req R6: Each Resource instance MAY indicate that it is a specific type of resource by use of the dcterms:type property. Catalogued Dataset instances are equivalent to catalogued Resource instances of dcterms:type Dataset.

Example: A Resource instance with minimum required metadata <http://example.com/resource/x> a dcat:Resource ; dcterm:identifier "CAT::2-3-4::X"^^xsd:token ; # Dummy catalogue number dcterms:title "Resource X" ; dcterms:description "An example Resource"@en ; dcterms:created "2022-07-21"^^xsd:date ; dcterms:modified "2022-07-25T20:15:21"^^xsd:dateTime ; prov:qualifiedAttribution [ a prov:Attribution; prov:agent <http://example.com/org/clc> ; # Example Indigenous org PID ex:hadRole :custodian ; ] , [ a prov:Attribution; prov:agent <http://example.com/academic/person-x> ; # An academic ex:hadRole :author ; ] , [ a prov:Attribution; prov:agent <http://example.com/org/xyz-people> ; # An Indigenous group ex:hadRole :subjectAgent ; ] ; dcterms:type ex:map ; # An example specialised resource - a map - from some vocabularty of types . <https://doi.org/a-doi-for-a-catalogue> # a catalogue indicating this Resource is a part of it dcterms:hasPart <http://example.com/resource/x> ; .

2.1.3 Dataset

There are no structural requirements specifically for Dataset instances: the requirements for Resource also apply to Dataset.

Note that there are values requirements for Dataset instnaces, as per the next section.

2.1.1 Attribution

Req A1: Each Attribution instance MUST indicate an Agent instance with the property prov:agent and a role for that Agent, as a skos:Concept, in relation to the attributing entity, with the hadRole property.

Example: A Resource with a qualified Attribution <http://example.com/resource/x> a dcat:Resource ; ... prov:qualifiedAttribution [ a prov:Attribution; prov:agent <http://example.com/resource/central-lands-council> ; dcat:hadRole :custodian ; ] ; ... .

2.1.5 Agent

Agents are the Organisations & People that have Roles in relation to Catalogues and Resources.

Req AG1: Each Agent instance MUST be either an sdo:Organization or a sdo:Person instance.

Req AG2: Each Agent instance MUST be described with at least the sdo:name property and, if an sdo:Organization, also an sdo:url property with a xsd:anyURI value or, if a sdo:Person, an sdo:email property with a xsd:anyURI value.

Req AG3: Each Agent instance SHOULD be described with a sdo:description property and, if the Agent has them, identifiers for it should be indicated with sdo:identifier with xsd:anyURI or xsd:token values.

Req AG4: Each Agent MAY relate other information using schema.org properties, for example sdo:affiliation to link a sdo:Person to an sdo:Organization.

Example: An Organization with an example Australian Business Number identifer and a Person affiliated with it <https://kurrawong.ai> a sdo:Organization ; sdo:identifier "31 353 542 036"^^ex:ABN ; sdo:name "KurrawongAI" ; sdo:description "KurrawongAI is a small, Artificial Intelligence, company in Australia specialising in Knowledge Graphs." ; sdo:url "https://kurrawong.ai"^^xsd:anyURI ; . <https://orcid.org/0000-0002-8742-7730> a sdo:Person ; sdo:name "Nicholas J. Car"@en ; sdo:email "nick@kurrawong.ai"^^xsd:anyURI ; sdo:affiliation <https://kurrawong.ai> ; .

2.1.5 Concept

This profile is a profile of Vocabulary Publications Profile, VocPub, among other specifications. VocPub sets requirements for Concept and ConceptScheme instances.

See the VocPub Concept requirements listed in its Specification:

Example: A Concept with basic VocPub requirements met <http://example.com/concept/x> a skos:Concept ; rdfs:isDefinedBy <http://example.com/concept-scheme/y> ; # Indicating the vocabulary that defines this term skos:prefLabel "Concept X"@en ; # The Concepts preferred label skos:altLabel "Xxx"@en ; # A synonym/alias skos:definition "An example Concept"@en ; # The Concept's definition skos:narrower <http://example.com/concept/x.1> ; .

2.1.6 ConceptScheme

This profile is a profile of Vocabulary Publications Profile, VocPub, among other specifications. VocPub sets requirements for Concept and ConceptScheme instances.

See the VocPub ConceptScheme requirements listed in its Specification's Vocabulary section:

4.2 Values Requirements

IDN metadata must be represented according to certain classes with certain properties, including relations between classes, as defined in the section above. This section defines the requirements for particular properties to indicate particular values. These values are mostly items in particular vocabularies and catalogues managed by the IDN itself and the result of these requirements is to ensure that IDN metadata is related to values understood by the IDN.

Req V1: Each Resource instance MUST be categorised with at least one value from the IDN Data Themes vocabulary, indicated with a theme property.

Req V2: The roles of Agent instances with relation to a Resource MUST be indicated with the hadRole property and selected from the IDN Role Codes Vocabulary.

Req V3: Each Agent instance referenced by an Attribution instance, if the Agent is Indigenous SHOULD be registered within the IDN Agents Catalogue.

Example: A Resource with a qualified Attribution <http://example.com/resource/x> a dcat:Resource ; ... dcat:theme idth:indigenous-demographics ; # Concept within the IDN Data Themes Vocabulary ... prov:qualifiedAttribution [ a prov:Attribution; # Example Agent that could be registered in the IDN Agents Catalogue prov:agent <http://example.com/resource/central-lands-council> ; dcat:hadRole :custodian ; # Role from the IDN Role Codes Vocabulary ] ; .

6. Extended Examples

6.1 Minimal

The machine-readable version of this metadata record is reproduced here in the RDF Turtle format:

PREFIX dcat: <http://www.w3.org/ns/dcat#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX sdo: <https://schema.org/> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#> <https://data.idnau.org/pid/10128420> a dcat:Resource ; dcterms:creator <https://example.com/agent/123> ; dcterms:description "Folktale: The little red hen retold in Pitjantjatjara" ; dcterms:published "1980"^^xsd:gYear ; dcterms:publisher <https://example.com/agent/123456> ; dcterms:title "Tjukinya" ; dcat:theme <https://data.idnau.org/pid/austlang/C6> ; prov:qualifiedAttribution [ prov:hadRole <https://linked.data.gov.au/def/data-roles/custodian> ; prov:agent <https://linked.data.gov.au/org/nla> ] ; .

Note that the reference to the publisher, the Summer Institute of Linguistics, is by persistent identifier web address, https://example.com/agent/456, that resolves to more information about the Institute. This particular identifier is from the IDN Agents Database, so the IDN potentially knows more information about the Institute than just basic facts such as its name, for example whether it is an Indigenous-led organisation.

This metadata record will not score well on most metrics for FAIR access, governance scores and so on since it's minimal, but it's better than nothing!

6.2 Formats

Metadata according to this Profile can be created using any RDF format.

Here is the same minimal metadata example as above shown there in HTML and the Turtle RDF format. This form is RDF's JSON-LD format, which is standard JSON:

{ "@context": { "@vocab": "http://purl.org/dc/terms/", "dcat": "http://www.w3.org/ns/dcat#", "austlang": "https://data.idnau.org/pid/austlang/", "xsd": "http://www.w3.org/2001/XMLSchema#" }, "@id": "https://data.idnau.org/pid/10128420", "@type": "dcat:Resource", "title": "Tjukinya", "description": "Folktale: The little red hen retold in Pitjantjatjara", "creator": { "@id": "https://example.com/agent/123" }, "publisher": { "@id": "https://example.com/agent/123456" }, "published": { "@type": "xsd:gYear", "@value": "1980" }, "dcat:theme": { "@id": "austlang:C6" } }

Here is the same minimal metadata example again in RDF's XML format:

<?xml version="1.0" encoding="utf-8"?> <rdf:RDF xmlns="http://purl.org/dc/terms/" xmlns:dcat="http://www.w3.org/ns/dcat#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" > <rdf:Description rdf:about="https://data.idnau.org/pid/10128420"> <title>Tjukinya</title> <description>Folktale: The little red hen retold in Pitjantjatjara</description> <rdf:type rdf:resource="http://www.w3.org/ns/dcat#Resource"/> <creator rdf:resource="https://example.com/agent/123"/> <publisher rdf:resource="https://example.com/agent/123456"/> <published rdf:datatype="http://www.w3.org/2001/XMLSchema#gYear">1980</published> <dcat:theme rdf:resource="https://data.idnau.org/pid/austlang/C6"/> </rdf:Description> </rdf:RDF>

The JSON & XML formats shown here can be converted to and from the RDF Turtle format shown in the minimal section above, and each other, using RDF format conversion tools such as KurrawongAI's RDF Converter.

Example: Partial content of the IDN Dataset Catalogue PREFIX dcat: <http://www.w3.org/ns/dcat#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX idnth: <https://data.idnau.org/pid/vocab/idn-th> PREFIX droles: <https://linked.data.gov.au/def/data-roles/> PREFIX owl: <http://www.w3.org/2002/07/owl#> PREFIX prov: <http://www.w3.org/ns/prov#> PREFIX sdo: <https://schema.org/> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#> <https://data.idnau.org/pid/democat> a dcat:Catalog ; dcterms:title "IDN Demonstration Catalogue" ; dcterms:description """The Indigenous Data Network's catalogue of datasets. This catalogue contains records of datasets in Australia, most of which have some relation to Indigenous Australia. The purpose of this catalogue is not to act as a master catalogue of Indigenous data in Australia to demonstrate improved metadata models and rating systems for data and metadata in order to improve Indigenous data governance. The content of this catalogue conforms to the Indigenous Data Network's Catalogue Profile which is a profile of the DCAT, SKOS and PROV data models."""@en ; dcterms:created "2022-07-31"^^xsd:date ; dcterms:modified "2022-11-09"^^xsd:date ; prov:qualifiedAttribution [ a prov:Attribution; prov:agent <https://linked.data.gov.au/org/idn> ; dcat:hadRole droles:author , droles:owner , droles:custodian ; ] ; dcterms:hasPart <http://example.com/resource/x> ; . <http://example.com/resource/x> a dcat:Dataset ; dcterms:title "Example Dataset 1" ; dcterms:description "This example dataset has a minimalist metadata record that is valid according to the Indigenous Data Network's Catalogue Profile"@en ; dcterms:created "2022-07-31"^^xsd:date ; dcterms:modified "2022-07-31"^^xsd:date ; dcat:theme idnth:indigenous-demographics ; prov:qualifiedAttribution [ a prov:Attribution ; prov:agent <https://orcid.org/0000-0002-8742-7730> ; dcat:hadRole droles:author ; # not being a real dataset, it has no roles other than 'author' for Agents associated with it ] ; . # From the IDN Agents Catalogue <https://linked.data.gov.au/org/idn> a sdo:Organization ; sdo:name "Indigenous Data Network" ; sdo:description "The IDN is within the University of Melbourne. It was established in 2018 to support and coordinate the governance of Indigenous data for Aboriginal and Torres Strait Islander peoples and empower Aboriginal and Torres Strait Islander communities to decide their own local data priorities." ; sdo:url "https://mspgh.unimelb.edu.au/centres-institutes/centre-for-health-equity/research-group/indigenous-data-network"^^xsd:anyURI ; . # From the IDN Agents Catalogue <https://orcid.org/0000-0002-8742-7730> a sdo:Person ; sdo:name "Nicholas J. Car"@en ; sdo:email "nick@kurrawong.ai"^^xsd:anyURI ; sdo:affiliation <https://kurrawong.ai> ; . <https://kurrawong.ai> a sdo:Organization ; sdo:name "KurrawongAI" ; sdo:description "KurrawongAI is a small, Artificial Intelligence, company in Australia specialising in Knowledge Graphs." ; sdo:url "https://kurrawong.ai"^^xsd:anyURI ; . # From the IDN Data Themes Vocabulary idnth:indigenous-demographics a skos:Concept ; skos:prefLabel "Indigenous Demographics"@en ; skos:definition "Concerned with the demographics of Indigenous people in Australia"@en ; skos:inScheme <https://data.idnau.org/pid/vocab/idn-th> ; . # From the IDN Role Codes Vocabulary droles:author a skos:Concept ; skos:prefLabel "Author"@en ; skos:definition "party who authored the resource"@en ; skos:inScheme <https://linked.data.gov.au/def/data-roles> ; . droles:owner a skos:Concept ; skos:prefLabel "Owner"@en ; skos:definition "party that owns the resource"@en ; skos:inScheme <https://linked.data.gov.au/def/data-roles> ; . droles:custodian a skos:Concept ; skos:prefLabel "Custodian"@en ; skos:definition "party that accepts accountability and responsibility for the resource and ensures appropriate care and maintenance of the resource"@en ; skos:inScheme <https://linked.data.gov.au/def/data-roles> ; .

The content for the example above can be accessed online from the IDN Dataset Catalogue's content repository:

5. Scores

B.1. FAIR Model

B.2. Indigeneity Model

B.3. CARE Model

B.4. Local Context Labels Model

B.4.1 Introduction

"The primary objectives of Local Contexts are to enhance and legitimize locally based decision-making and Indigenous governance frameworks for determining ownership, access, and culturally appropriate conditions for sharing historical, contemporary and future collections of cultural heritage and Indigenous data"

About Local Contexts web page

This Local Context model is a small ontology providing just a few classes and properties to indicate how to attach LC labels to objects catalogued according to this profile.

The machine-readable version of this model is available at:

The prefix lc: is used in this section for the namespace of this model: https://data.idnau.org/pid/vocab/lc-labels/.

Figure B.4.1 overviews the model.

Local Context model

Figure B.4.1: Overview of elements of the Local Context Model.

This model requires that a LocalContextLabel, which is a specialised skos:Concept, is assigned to a catalogued dcat:Resource via an intermediate node of type QualifiedLocalContext. This use of the "Qualified Relations" graph modelling pattern allows for the nature of the Resource/LocalContextLabel relationship to be defined.

B.4.2 Definitions

B.4.2 LocalContextLabel class definition

PropertyValue
IRI https://data.idnau.org/pid/vocab/lc-labels/LocalContextLabel
Preferred Label Local Context Label
Definition A label to be applied to data that allow communities to express local and specific conditions for sharing and engaging in future research and relationships in ways that are consistent with already existing community rules, governance and protocols
Source https://localcontexts.org/labels/traditional-knowledge-labels/
Subclass Of SKOS Concept
Expected Properties Standard SKOS Concept properties

B.4.2.2 QualifiedLocalContext class definition

PropertyValue
IRI https://data.idnau.org/pid/vocab/lc-labels/QualifiedLocalContext
Preferred Label Qualified Local Context
Definition An association between an RDF Resource and a Local Context Label that allows for the nature of the relationship to be defined
Provenance Developed for this data model, based on standard Linked Data graph patterns
Subclass Of SKOS Concept
Expected Properties

B.4.2.3 requirement property definition

PropertyValue
IRI https://data.idnau.org/pid/vocab/lc-labels/requirement
Preferred Label requirement
Definition A description of the necessity for a Local Context Label
Provenance Developed for this data model, to cater for labels needed but not present
Domain Qualified Local Context
Range A textual, literal, value

B.4.3 LocalContextLabel instances

LocalContextLabel instances for all defined TK & BC labels are given in two vocabularies:

  1. BC Labels Vocabulary
  2. TK Labels Vocabulary

The vocabularies above, while they have been created by the IDN, are not available publicly.

An example of RDF data for one of these labels is:

PREFIX tk: <https://data.idnau.org/pid/vocab/tk-labels/> tk:women-restricted a skos:Concept , lc:TKLabel ; dcterms:identifier "women-restricted"^^xsd:token ; dcterms:source "https://localcontexts.org/label/tk-women-restricted/"^^xsd:anyURI ; rdfs:isDefinedBy cs: ; skos:definition "This Label should be used when you want to let external users know that the material circulating freely is actually of a highly restricted nature. This is a Women’s Highly Restricted Label and indicates that there are restrictions of access and use based on customary law. This Label can be used to help external users recognize that with this material there are very specific protocols and conditions of use. This Label is designed to recognize that some knowledge is gendered, and that certain knowledge expressions can only be shared among specific members of the community. Only authorized [and/or initiated] women within the community should be using this material."@en ; skos:inScheme cs: ; skos:notation "TK WR" ; skos:prefLabel "Women Restricted"@en ; .

B.4.4 Examples of a Resource with LC labels

Example: A Dataset that does have a label applied to it but requires one <https://www.atsida.edu.au/archive/datasets/au.edu.anu.ada.ddi.20002-aus> a dcat:Dataset; dcterms:title "Annual Aboriginal Census,1921-1944 - Australia"@en ; ... dcat:theme [ a lc:QualifiedLocalContext ; dcterms:type tk:culturally-sensitive ; lc:requirement "This dataset contains information collected in ways no longer thought to be best practice. The data may, if used unwarily, put Indigenous people in an un-due negative light."@en ; ] ; ... .

Example: A Dataset that does have a label applied to it but requires one <http://dx.doi.org/10.26193/V93KUP> a dcat:Dataset; dcterms:title "Aboriginal radio broadcasting in Alice Springs, 1981"@en ; ... dcat:theme [ a lc:QualifiedLocalContext ; dcterms:type tk:multiple-communities ; rdf:value "Responsibility and ownership over this material is spread across several distinct communities. Use will be dependent upon discussion and negotiation with the multiple communities named herein Central Lands Council, Waltja Tjutangku Palyapayi Aboriginal Corporation, Akeyulerre Healing Centre. Decisions about use will need to be decided collectively. As an external user of this material you are asked to recognize and respect cultural protocols in relation to the use of this material and clear your intended use with the relevant communities."@en ; ] ; ... .

Annex A. Mappings

This Annex defines mappings between this Profile and assessment models.

A.1. FAIR Mapping

A.2. Indigeneity Mapping

A.3. CARE Mapping

A.4. Local Context Labels Mapping

References

RFC2119
Bradner, S. Key words for use in RFCs to Indicate Requirement Levels. March 1997. Internet Engineering Task Force. Best Current Practice. URL: https://tools.ietf.org/html/rfc2119
PROF
Rob Atkinson; Nicholas J. Car (eds.). The Profiles Vocabulary. 18 December 2019. W3C Working Group Note. URL: https://www.w3.org/TR/dx-prof/
PROV
Timothy Lebo, Satya Sahoo, Deborah McGuinness (eds.). PROV-O: The PROV Ontology. 30 April 2013. W3C Recommendation. URL: https://www.w3.org/TR/prov-o/
Semantic Web
World Wide Web Consortium. Semantic Web 2015. Web Page. URL: https://www.w3.org/standards/semanticweb/, accessed 2020-06-14
Turtle
World Wide Web Consortium. RDF 1.1 Turtle Terse RDF Triple Language, W3C Recommendation (25 February 2014). URL: https://www.w3.org/standards/semanticweb/