Skip to main content
Back to Standards
Journal Article Tag Suite logo

Journal Article Tag Suite

JATS

An XML tag set for marking up journal articles, defined as NISO Z39.96-2024. JATS provides three article models of increasing prescriptiveness: Archiving and Interchange (the most permissive), Journal Publishing (moderately prescriptive), and Article Authoring (the most prescriptive). Originally developed as the NLM Archiving and Interchange DTD beginning in 2002 by NCBI at the National Library of Medicine, it is now the dominant standard for full-text scholarly article encoding used by publishers, repositories, and aggregators worldwide.

Overview

The Journal Article Tag Suite (JATS) is the prevailing XML standard for encoding scholarly journal articles in full text. Defined as NISO Z39.96, it provides a rich, structured vocabulary for marking up every component of a journal article -- from front matter and body text to figures, tables, references, and supplementary materials. JATS underpins the digital infrastructure of scientific publishing, used by major publishers, PubMed Central, and repository systems worldwide.

Background

JATS traces its origins to 2002, when the National Center for Biotechnology Information (NCBI) at the U.S. National Library of Medicine began developing the NLM Archiving and Interchange DTD. The goal was to create a common XML format that could faithfully represent journal articles from diverse publishers for long-term archiving in PubMed Central (PMC). As the DTD gained adoption beyond NLM, the National Information Standards Organization (NISO) took it through formal standardization, publishing it as ANSI/NISO Z39.96 in 2012. The standard has been revised through versions 1.1, 1.2, 1.3, and the current 1.4 (2024).

Purpose and Scope

JATS defines a set of XML elements and attributes for tagging the full content of journal articles. It addresses not only the article text but also structured metadata (authors, affiliations, funding, keywords), mathematical notation (via MathML), chemical structures, tables, figures with captions, and bibliographic references. The standard is designed to support both the original rendering of an article and its long-term preservation and interchange between systems.

JATS provides three article models, each reflecting a different level of prescriptiveness:

  • Journal Archiving and Interchange -- The most permissive tag set, designed to accommodate articles from many publishers with varying structures
  • Journal Publishing -- A moderately prescriptive tag set suited for publishers producing new content
  • Article Authoring -- The most prescriptive, intended for authors creating articles from scratch

Extensions

JATS has spawned related tag suites for adjacent domains:

Extension Purpose
BITS (Book Interchange Tag Suite) XML encoding for books and book components
NISO STS (Standards Tag Suite) XML encoding for standards documents

Serializations and Technical Formats

JATS is an XML application. Articles are encoded as well-formed XML documents conforming to the JATS DTD or XML Schema. The standard defines both DTD and XSD schema representations. JATS documents are typically stored as .xml files and rendered for display through XSLT transformations or specialized publishing platforms.

Governance and Maintenance

JATS is maintained by the NISO Z39.96 standing committee, with the supporting documentation and reference materials hosted by NCBI at the National Library of Medicine. The JATS-Con conference series (held periodically since 2009) brings together publishers, technologists, and standards developers to share experiences and advance the standard.

Notable Implementations

PubMed Central (PMC), the full-text archive of biomedical and life sciences literature at the NLM, is the largest single repository of JATS-encoded content. Major commercial publishers including Elsevier, Springer Nature, Wiley, and Taylor & Francis produce JATS XML as part of their production workflows. The Crossref and CHORUS initiatives use JATS metadata. Open-source tools such as the JATS4R (JATS for Reuse) recommendations guide consistent implementation across the scholarly publishing ecosystem.

Related Standards

  • EPUB -- Electronic publication format that can incorporate JATS-encoded content
  • Dublin Core -- Metadata elements frequently embedded within JATS article headers
  • MathML -- Mathematical markup language used within JATS for equations

Further Reading