Article thumbnail

ARISTO: ontological classification of small molecules by electron ionization-mass spectrometry

By Manor Askenazi and Michal Linial


Gas chromatography–mass spectrometry (GC–MS) acquisitions routinely yield hundreds to thousands of Electron Ionization (EI) mass spectra. The chemical identification of these spectra typically involves a search protocol that seeks an exact match to a reference spectrum. Reference spectra are found in comprehensive libraries of small molecule EI spectra curated by commercial and public entities. We developed ARISTO (Automatic Reduction of Ion Spectra To Ontology), a webtool, which provides information regarding the general chemical nature of the compound underlying an input EI mass spectrum. Importantly, ARISTO can provide such annotation without necessitating an exact match to a specific compound. ARISTO provides assignments to a subset of the ChEBI (Chemical Entities of Biological Interest) dictionary, an ontology, which aims to cover biologically relevant small molecules. Our system takes as input a mass spectrum represented as a series of mass and intensity pairs; the system returns a graphical representation of the supported ontology as well as a detailed table of suggested annotations along with their associated statistical evidence. ARISTO is accessible at this URL: The system is free, open to all and does not require registration of any sort

Topics: Articles
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (1999). An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data.
  2. (2009). Changing the face of scientific publishing.
  3. (2009). Chemical entities of biological interest: an update.
  4. (1995). Chemical substructure identification by mass spectral library searching.
  5. (2000). Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.
  6. (2001). Library search of mass spectra with a new matching algorithm based on substructure similarity.
  7. (2010). MassBank: a public repository for sharing mass spectral data for life sciences.
  8. (1994). Optimization and testing of mass spectral library search algorithms for compound identification.
  9. (1993). Potential bile acid metabolites. 20. A new synthetic route to stereoisomeric 3,6-dihydroxy- and 6-hydroxy-5 alpha-cholanoic acids.
  10. (1999). The critical evaluation of a comprehensive mass spectral library.
  11. (2010). The status of the InChI project and the InChI trust.