Enriching Digital Objects: a Policy Framework
See the definitions of objects and enriching before reading on.
- annotations are research data
- annotations enrich objects by adding or linking to information
- privacy of annotators
- ethical processing of annotation objects
- secure access to annotations
- preservation of annotations
Laws and regulations¶
- GDPR: see Data protection on the library website for some pointers
Digitaal Erfgoed Referentiearchitectuur (DERA) defines four operational goals to implement its strategic goal, the third of which is 'Users can add new or external information when necessary.' This reference architecture is aimed at cultural heritage institutions, but this specific goal is clearly connected to our aim – to allow people to enrich digital objects.
Links between objects and enrichments¶
Define object bounds and types of enrichments. Objects may consist of various components, like high-resolution master image(s), thumbnail(s) and transcriptions of the text in the object.
Model enrichments as annotations directly or use an annotation to make discovering more complex external descriptions of the resource easier. The latter use of annotations may be useful when an enrichment is more than a single connection between two objects.
Annotations are research data¶
All policies for research data also apply to annotations, and hence to enrichments as well.
Enrichments should (thus) be FAIR. Use standards for annotations, like the Web Annotation Model.
Data formats for annotations¶
- CSV + CSV metadata
- other data formats
- Hypothes.is JSON
- brat stand-off format
- VGG Image Annotator format (JSON or CSV)
Bodies of annotations¶
Annotations can have one or more bodies, or none (for specific motivations). If there are multiple bodies, they are usually related, but how they are related may vary.
- external, can be anything
- full resources
- fragments of resources
- HTML fragment
- plain text
Example: TEI file as external body¶
When you transcribe text from manuscripts, you may want to use the TEI guidelines and markup. You can refer to (fragments of) external TEI files as the body of annotations.
Example: HTML file as external body¶
Many documents on the Web are available in HTML format. You can refer to (fragments of) HTML files as the body of annotations.
Motivation and purpose of annotations¶
In the Web Annotation Model, motivation and purpose are used to present a reason for the annotation, thereby also explaining the nature of the relationship between the target and the body of the annotation.
The motivations and purposes defined by the Web Annotation Model are useful, but deliberately generic. They can be specialised, though any terms created for special purposes should get persistent identifiers and clear explanations for their use.
Storage and access¶
- Linked Data
- URIs resolve to descriptions
- access one object at a time
- Linked Data Fragments / Triple Pattern Fragments
- SPARQL endpoint
- RDF dumps in a TDR
Informing the object holder¶
When annotations are not stored by the provider (holder) of the annotated object, they may be informed about the annotations. Some providers may provide instructions for informing them.
Annotations are RDF graphs and can include authorship information about the external body, but what if this doesn't match the metadata of the external body? The Web Annotation Model specifies that the external body's metadata is the canonical source.
Who can add, edit and delete annotations? How do you authenticate and authorise users? These decisions influence the integrity of your research data.
Copyright and content policies¶
What kind of content may be added in annotations? How will you monitor whether users adhere to this? What do you do in case of violations of copyright laws or content policies? Who decides what is allowed and who makes the final decision in case of a dispute over contents of annotations?
The answers to these questions are perhaps more relevant to your research when you involve citizen science or crowdsourcing and when you cannot be sure that every contributor means well.