Skip to main content
Back to Standards
Chemical Entities of Biological Interest logo

Chemical Entities of Biological Interest

ChEBI

A freely accessible database and ontology of molecular entities focused on small chemical compounds of biological interest, maintained by the European Bioinformatics Institute (EMBL-EBI) as part of the Open Biomedical Ontologies effort. ChEBI contains over 195,000 entries covering naturally occurring molecules and synthetic compounds used to intervene in the processes of living organisms, using IUPAC and NC-IUBMB nomenclature. The ontology includes a hierarchical classification enabling queries by chemical class and biological role.

Overview

Chemical Entities of Biological Interest (ChEBI) is a freely available database and ontology of molecular entities maintained by the European Bioinformatics Institute (EMBL-EBI). Covering over 195,000 chemical entities, ChEBI provides a structured classification of small molecules relevant to biology, from metabolites and drugs to environmental chemicals, using internationally standardized nomenclature.

Background

ChEBI was developed at the European Bioinformatics Institute as part of the Open Biomedical Ontologies (OBO) effort, with the first formal description published in 2008. The project was motivated by the need for a freely available, expertly curated chemical ontology that could serve as a reference vocabulary for chemical entities encountered in biological research. Prior to ChEBI, chemical databases were typically either proprietary or lacked the ontological structure needed for computational reasoning about chemical relationships.

ChEBI distinguishes itself from other chemical databases by its focus on biological relevance and its incorporation of an ontological classification. Macromolecules directly encoded by the genome, such as nucleic acids and proteins, are excluded; ChEBI focuses on small chemical compounds and related entities.

Purpose & Scope

ChEBI serves as both a database and an ontology. As a database, it provides detailed information about individual chemical entities including:

  • Systematic and common names following IUPAC and NC-IUBMB nomenclature
  • Molecular formulae, InChI strings, and SMILES representations
  • Cross-references to other databases (PubChem, ChEMBL, DrugBank, KEGG, etc.)
  • Literature citations and species data

As an ontology, ChEBI organizes entities into a classification hierarchy that enables queries based on chemical class (e.g., "amino acid," "carbohydrate") and biological role (e.g., "enzyme inhibitor," "neurotransmitter"). This dual nature makes ChEBI valuable both for record-level chemical identification and for higher-level reasoning about chemical categories.

Key Statistics

Metric Value
Total entries 195,000+
Manually curated Three-star entries (fully curated)
Nomenclature standard IUPAC / NC-IUBMB
Part of OBO Foundry

Serializations & Technical Formats

ChEBI is available in multiple formats: OBO format and OWL for the ontological classification, SDF for chemical structure data, and flat file and SQL dump formats for the full database. Data can also be accessed through web services and a SPARQL endpoint. All downloads are available from the ChEBI website.

Governance & Maintenance

ChEBI is maintained by the ChEBI team at the European Bioinformatics Institute (EMBL-EBI), part of the European Molecular Biology Laboratory. Curation follows a three-tier quality system: one-star entries are automatically generated, two-star entries have been checked by a curator, and three-star entries have been fully manually curated with verified chemical structures and complete annotation. Community submissions are accepted through the ChEBI Submission Portal.

ChEBI is a member of the OBO Foundry, following its principles for open, interoperable ontology development.

Notable Implementations

ChEBI identifiers are widely used across biological databases as the standard reference for small molecule identification. They appear in UniProt, Reactome, MetaboLights, IntEnz, and many pathway and metabolomics databases. ChEBI terms are used in GO annotations to specify chemical participants in molecular functions and biological processes. The ontology is also integrated into the EMBL-EBI Ontology Lookup Service (OLS).

Related Standards

  • Gene Ontology — uses ChEBI terms to specify chemical participants in GO annotations
  • PubChem — complementary chemical database with broader scope including non-biological compounds

Further Reading

Resources & Links