Skip to main content
Back to Standards
Nanopublication logo

Nanopublication

A data model for representing the smallest unit of publishable information as a set of RDF named graphs. Each nanopublication consists of three core components: an assertion graph containing a scientific claim or data statement expressed as RDF triples, a provenance graph recording how the assertion was derived, and a publication information graph providing metadata about the nanopublication itself. Nanopublications are formal, machine-interpretable knowledge graph snippets that can be attributed, cited, and published to a decentralized server network, enabling granular dissemination and reuse of scientific findings.

Overview

Nanopublications represent the smallest unit of publishable information in a machine-readable, attributable, and interoperable format. Built on RDF named graphs, the model enables scientists to publish individual assertions -- single claims, observations, or data relationships -- with full provenance and attribution metadata, supporting granular credit and citation at a level far more precise than traditional journal articles.

Background

The nanopublication concept was proposed by Barend Mons and Jan Velterop around 2009 as a response to the challenge of making scientific data findings individually citable and reusable. Traditional scientific publishing bundles many findings into a single article, making it difficult to reference, attribute, or verify individual claims. Nanopublications decompose this into atomic units of knowledge that can be independently published, cited, and linked.

Purpose and Scope

A nanopublication packages a single piece of knowledge -- for example, a relationship between a gene and a disease, or a measurement result -- in a formal structure that machines can interpret. Each nanopublication has three core components:

  • Assertion: The main content, expressed as an atomic unit of information in RDF triples
  • Provenance: How the assertion came to be, including the scientific methods used, reference to the study type, and its parameters
  • Publication Info: Metadata about the nanopublication itself, such as its creator, creation date, and license terms

Because nanopublications can be attributed and cited, they provide incentives for researchers to make their data available in standard formats, driving data accessibility and interoperability.

Technical Implementation

Nanopublications are implemented in RDF, typically serialized as TriG (which supports named graphs). They can be published to a decentralized server network, then queried, accessed, reused, and linked. The evolving ecosystem includes tools for creation, validation, querying, and browsing of nanopublications.

Governance and Maintenance

The nanopublication community maintains the specification, tooling, and server infrastructure. The model is defined by the nanopublication schema namespace (http://www.nanopub.org/nschema#). Guidelines and documentation are available at nanopub.net.

Notable Implementations

Nanopublications have been adopted in biomedical research, particularly for publishing gene-disease associations, drug-target interactions, and other structured scientific claims. The decentralized server network allows nanopublications to be published without dependence on a central authority.

Related Standards

  • PROV (prov): The provenance component of nanopublications aligns with W3C PROV concepts

Further Reading