Classic Note Entries

Types of Metadata that we Need

How do these thing go together?

Types of Metadata

(intrinsic, descriptive, needed for management)

  • Subject metadata (classification, keywords)
  • Geospatial metadata (postcode, map references, place names)
  • Person-related metadata (people, institutions, co-workers, projects)
  • Usage-related metadata (course reading lists, comments, annotations, library borrowing records)
  • File Format metadata (definitive for preservation)
  • Factual metadata (date, time, user, software system, etc)
  • Bibliographic metadata (including citation)
  • Multilingual/translated metadata (automatic metadata translation)
  • Integrating AMG into deposit workflows (web service orchestration)
  • Useage Data (how often, by whom, etc. was this resource used)

Annotated Bibliography on Automated Metadata Generation
Natural Language Text Processing with Python

  • Technical and Structural about the Data object itself
  • Descriptive, Administrative, Contextual

Metadata Functions ``` John's JISC CETIS blog```

  • identify
  • find
  • select
  • use
  • cite
  • manage

http://blogs.cetis.ac.uk/johnr/files/2009/08/oermetadata1.jpg

Descriptive Metadata

Supports finding, ordering, and retrieving data. One of the most common descriptive metadata schemas for digital objects is Dublin Core which includes the elements Title, Creator, Subject, Description, Publisher, Contributor, Date, Type, Format, Identifier, Source, Language, Relation, Coverage and Rights. There are a variety of other descriptive schemas, often particular to a specific file type or field of study.

Representational Metadata

Information required to make a digital object readable. For instance, representational metadata for a MS Word file might include documentation for MS Word, potentially a copy of MS Word. We might also need to include information about operating systems and system requirements for MS Word.

Provenance Metadata

Information documenting how the data was collected and who was responsible for the collection. Additionally, provenance metadata should document any changes made to the data, along with the people and software responsible for the changes. Often, provenance metadata can be best represented by a well documented workflow.

Contextual Metadata

Contextual metadata documents the relationship between the data object and its environment. In the scientific field this might include the grant proposal related to the data, any publications arising from the data, an annotated bibliography, even news articles that bear upon the data. Contextual metadata can greatly aid others in understanding the purpose and motivation behind collecting the data, and also document the impact that the data has had.

Fixity Metadata

THis documents authentication mechanisms and file integrity checks. A common form of fixity metadata is an MD5 hash on files. MD5 generates a 32-digit hexadecimal number that will change if even one bit of the target file changes. You will often see MD5 hash numbers associated with downloadable files. This would allow you to verify that the file you downloaded arrived uncorrupted.

Reference Metadata

Documentation about the generation of unique identifiers for an object. Common reference metadata are NCBI accession numbers and ISBN numbers on books.