Skip to main content
Back to Standards
IPTC NewsCodes logo

IPTC NewsCodes

A comprehensive collection of controlled vocabularies maintained by the International Press Telecommunications Council (IPTC) for the news industry. IPTC NewsCodes encompass over fifty taxonomies covering subjects (Media Topics), genres, content roles, video codecs, and other dimensions of news content classification. The vocabularies are language-agnostic by design, using codes with multilingual definitions, and are available in machine-readable formats including RDF/XML, JSON-LD, and SKOS. They are freely licensed under CC-BY 4.0.

Overview

IPTC NewsCodes are a comprehensive suite of controlled vocabularies maintained by the International Press Telecommunications Council for classifying and describing news content. Comprising over fifty individual taxonomies, they provide a standardized, language-agnostic system of codes that news organizations worldwide use to annotate text, photographs, graphics, audio, and video.

Background

The International Press Telecommunications Council has been developing technical standards for the news industry since 1965. The NewsCodes system evolved from earlier IPTC classification schemes, notably the IPTC Subject Codes, which provided hierarchical subject classification for news content. As the needs of the news industry grew more complex — encompassing not just subject matter but genres, content roles, video codecs, and dozens of other dimensions — IPTC expanded its approach from a single taxonomy to a collection of purpose-specific controlled vocabularies grouped under the NewsCodes umbrella.

Since 2010, IPTC has recommended the Media Topics vocabulary as the primary subject classification scheme, replacing the older Subject Codes. Mappings between the legacy Subject Codes and the newer Media Topics are provided for migration purposes.

Purpose & Scope

NewsCodes serve a fundamental need in the news industry: the ability to apply consistent, machine-processable metadata to news items that can be shared across organizations and over time. Unlike free-text descriptions, codes are unambiguous and language-neutral — the same code carries identical meaning regardless of the language of the content it describes. Each code has an explicit, comprehensive definition, and definitions are translated into multiple languages to support international use.

The vocabularies cover a wide range of classification dimensions, including but not limited to subject matter (Media Topics), genre, content warnings, scene types, content production roles, and technical format descriptions such as video codecs.

Key Vocabularies

Vocabulary Purpose
Media Topics Subject classification (recommended since 2010)
Subject Codes Legacy subject classification (predecessor to Media Topics)
Genre Content genre classification
Scene Scene type for visual content
Content Warning Warnings about sensitive content
Digital Source Type How digital content was created or obtained

Technical Access

All IPTC NewsCodes are available through the IPTC Controlled Vocabulary server at http://cv.iptc.org/newscodes/ in machine-readable formats suitable for automated ingestion into content management systems. Formats include RDF/XML, JSON-LD, and SKOS representations. An interactive tree view is available for browsing the Media Topics vocabulary.

Licensing

All IPTC NewsCodes are licensed under the Creative Commons Attribution 4.0 (CC-BY 4.0) license. They may be used at any stage of a news workflow without royalty fees, with the requirement that IPTC is credited.

Governance & Maintenance

NewsCodes are maintained by the IPTC, with individual vocabularies versioned and updated independently. The IPTC welcomes suggestions for new topics and vocabulary modifications through its guidelines process. Changes are published to the controlled vocabulary server.

Notable Implementations

IPTC NewsCodes are used by the world's largest news agencies, including Agence France-Presse, Associated Press, and Reuters, primarily through their NewsML-G2 content packages. The Media Topics vocabulary has gained particular traction in Scandinavia, where the Swedish news agency TT and the Norwegian news agency NTB use it for content sharing. Ritzau in Denmark and ABC Australia use Media Topics as the core of their extended vocabularies. Automated classification tools from vendors such as iMatrics, Microsoft Azure Media Services, MeaningCloud, and TextRazor support IPTC Media Topics natively.

Related Standards

  • IPTC Photo Metadata — the IPTC standard for embedded image metadata, which references NewsCodes vocabularies
  • NewsML-G2 — the IPTC XML news exchange format that uses NewsCodes as its controlled vocabulary layer
  • rNews — the IPTC RDFa vocabulary for annotating news on the web, referencing NewsCodes concepts

Further Reading