Skip to main content
Back to Standards
DOI Kernel Metadata Declaration logo

DOI Kernel Metadata Declaration

The DOI Kernel Metadata Declaration is the minimum set of metadata elements required for every Digital Object Identifier (DOI) name, defined by an XML Schema maintained by the International DOI Foundation. Version 2.2 specifies 17 elements across two groups: 14 descriptive elements (DOI name, referent identifiers, referent names, primary referent type, structural type, mode, character, referent type, linked creations, linked parties, principal agent, dates, and associated territory) and 3 administrative elements (registration authority code, issue date, and issue number). The kernel is grounded in the indecs Framework ontology and formalized through ISO 26324.

Overview

The DOI Kernel Metadata Declaration defines the minimum set of metadata that must accompany every Digital Object Identifier (DOI) name. Maintained by the International DOI Foundation (IDF) and formalized as part of ISO 26324, the kernel ensures baseline descriptive and administrative information is always available for any DOI-identified entity, regardless of which Registration Agency issued the identifier or what domain the resource belongs to. Version 2.2 of the kernel XML schema is the current release, with allowed value sets last updated in July 2020.

Background

The Digital Object Identifier system emerged in the late 1990s from the publishing industry's need for persistent identifiers for digital content. The DOI Data Model, which includes the kernel, draws on the indecs (interoperability of data in e-commerce) Framework, an influential metadata interoperability project (1998--2000) that pioneered event-based metadata models for integrating digital transactions across content, author, creator, library, publisher, and rights communities. The DOI system was formalized as ISO 26324 and is governed by the International DOI Foundation, a non-profit organization.

The kernel itself serves two purposes articulated in the DOI Handbook: recognition (making it possible to understand what kind of thing a DOI identifies and to match it with reasonable accuracy to a particular entity) and interoperability (ensuring metadata from different Registration Agencies can be combined or queried by the same software without requiring semantic mapping or transformation).

Purpose & Scope

The DOI Kernel answers fundamental questions about any DOI-identified entity. Its 17 elements are divided into two groups.

Descriptive Elements (Table 4.1 of DOI Handbook)

Element Occurs Description
DOI name 1 The specific DOI name allocated to the referent
referentIdentifier(s) 0-n Other identifiers for the same referent (ISBN, ISSN, ISRC, ISAN, ISNI, etc.)
referentName(s) 0-n Names or titles by which the referent is usually known
primaryReferentType 1 Whether the referent is a creation, party, or event (open list)
structuralType 1 For creations: physical, digital, performance, or abstraction; for parties: person, animal, or organization (closed lists)
mode 0-n Principal sensory modes of perception: audio, visual, tangible, olfactory, tasteable, none (closed list)
character 0-n Fundamental communication form: music, language, image, other (closed list)
referentType 0-n More specific type classifications (open list)
linkedCreation 0-n Associations with other creations, including role relationships
linkedParty 0-n Associations with other parties
principalAgent 0-n Entities principally responsible for creation or publication
dateOfBirthOrFormation 0-1 For parties only
dateOfDeathOrDissolution 0-1 For parties only
associatedTerritory 0-n For parties only, using ISO 3166a2 codes

Administrative Elements (Table 4.2)

Element Occurs Description
registrationAuthorityCode 1 Code identifying the issuing Registration Agency
issueDate 1 Date when the DOI name was issued
issueNumber 0-1 Version designation of the kernel declaration

Controlled Vocabularies and Allowed Value Sets

The kernel uses both closed and open controlled vocabularies. Closed lists (structuralType, mode, character, language via ISO 639-2, territory via ISO 3166a2) cannot be modified by Registration Agencies. Open lists (primaryReferentType, agentRole, creationType, creationIdentifierType, and others) can be extended by any RA through registration in the DOI Data Dictionary.

Version 2.2 expanded several open lists at the request of the EIDR Registration Agency, adding creation types (Film, TVProgramme, WebResource, Season, Clip, and others), new creation-to-creation link roles (Episode, Season, Edit, Performance, Derivation, and others), and additional creation identifier types (Ad-ID, SMPTE-UMID, UUID, URI, and others). It also introduced non-ISO territory codes for defunct territories (DDR, USSR, Yugoslavia) and restructured creation identifiers to support both resolvable URIs and non-resolvable values.

The allowed value sets are maintained in a separate namespace at http://doi.org/10.1000/282, introduced in version 2.2 to make them available to schemas beyond the kernel.

Serialization & Technical Format

The DOI Kernel is expressed as an XML Schema (XSD), maintained by the IDF at http://doi.org/10.1000/276. Registration Agencies may implement the kernel either by producing declarations directly against the kernel XSD or by incorporating kernel elements into their own wider metadata schemas. The DOI Data Dictionary, implemented within the Vocabulary Mapping Framework (VMF), provides the ontological underpinning and supports mapping between kernel terms and those of other metadata standards (ONIX, DDEX, MARC21, Dublin Core, RDA, CIDOC-CRM, FRBR, LOM).

Governance & Maintenance

The IDF governs the kernel through the DOI-RATech group, which coordinates schema updates. Any member of the group may propose changes; routine updates are typically implemented within two weeks. The update procedure involves proposal, consensus review, draft implementation, review period, and release. The DOI Foundation site content is licensed under CC BY 4.0.

ISO 26324 mandates that the data dictionary serve as the repository for all data elements and allowed values, and that all allowed values used in kernel metadata be registered there to facilitate integration of DOI data from different sources.

Notable Implementations

Every DOI ever issued carries kernel metadata. The most prominent Registration Agencies implementing the kernel include:

  • Crossref -- scholarly articles, books, conference proceedings
  • DataCite -- research datasets and other research outputs
  • mEDRA -- European publications
  • EIDR -- entertainment industry identifiers for audiovisual works (the primary driver of version 2.2 changes)
  • KISTI -- Korean scientific and technical information

As of the mid-2020s, more than 300 million DOIs have been registered, each with an associated kernel metadata declaration.

Related Standards

  • indecs Framework -- the semantic interoperability model underlying the DOI Data Model
  • ISO 26324 -- the ISO standard for the DOI system as a whole
  • Vocabulary Mapping Framework (VMF) -- mapping tool supporting semantic interoperability across DOI and non-DOI metadata standards
  • Crossref Metadata Schema -- an RA-specific extension building on DOI metadata
  • DataCite Metadata Schema -- another major RA-level schema built on the DOI foundation

Further Reading