Types of Metadata that we Need
How do these thing go together?
Types of Metadata
(intrinsic, descriptive, needed for management)
- Subject metadata (classification, keywords)
- Geospatial metadata (postcode, map references, place names)
- Person-related metadata (people, institutions, co-workers, projects)
- Usage-related metadata (course reading lists, comments, annotations, library borrowing records)
- File Format metadata (definitive for preservation)
- Factual metadata (date, time, user, software system, etc)
- Bibliographic metadata (including citation)
- Multilingual/translated metadata (automatic metadata translation)
- Integrating AMG into deposit workflows (web service orchestration)
- Useage Data (how often, by whom, etc. was this resource
used)
Annotated
Bibliography on Automated Metadata Generation
Natural Language Text
Processing with Python
- Technical and Structural about the Data object itself
- Descriptive, Administrative, Contextual
Metadata Functions ``` John's JISC CETIS blog```
- identify
- find
- select
- use
- cite
- manage
Descriptive Metadata
Supports finding, ordering, and retrieving data. One of the
most common descriptive metadata schemas for digital objects is
Dublin Core which includes the elements Title, Creator,
Subject, Description, Publisher, Contributor, Date, Type,
Format, Identifier, Source, Language, Relation, Coverage and
Rights. There are a variety of other descriptive schemas, often
particular to a specific file type or field of study.
Representational Metadata
Information required to make a digital object readable. For
instance, representational metadata for a MS Word file might
include documentation for MS Word, potentially a copy of MS
Word. We might also need to include information about operating
systems and system requirements for MS Word.
Provenance Metadata
Information documenting how the data was collected and who
was responsible for the collection. Additionally, provenance
metadata should document any changes made to the data, along
with the people and software responsible for the changes.
Often, provenance metadata can be best represented by a well
documented workflow.
Contextual Metadata
Contextual metadata documents the relationship between the
data object and its environment. In the scientific field this
might include the grant proposal related to the data, any
publications arising from the data, an annotated bibliography,
even news articles that bear upon the data. Contextual metadata
can greatly aid others in understanding the purpose and
motivation behind collecting the data, and also document the
impact that the data has had.
Fixity Metadata
THis documents authentication mechanisms and file integrity
checks. A common form of fixity metadata is an MD5 hash on
files. MD5 generates a 32-digit hexadecimal number that will
change if even one bit of the target file changes. You will
often see MD5 hash numbers associated with downloadable files.
This would allow you to verify that the file you downloaded
arrived uncorrupted.
Reference Metadata
Documentation about the generation of unique identifiers for an object. Common reference metadata are NCBI accession numbers and ISBN numbers on books.