There is a growing need for indexing and harmonizing
retention
time (tR) data in liquid chromatography derived under different conditions
to aid in the identification of compounds in high resolution mass
spectrometry (HRMS) based suspect and nontarget screening of environmental
samples. In this study, a rigorously tested, inexpensive, and simple
system-independent retention index (RI) approach is presented for
liquid chromatography (LC), based on the cocamide diethanolamine homologous
series (C(n = 0–23)-DEA). The validation of
the CDEA based RI system was checked rigorously on eight different
instrumentation and LC conditions. The RI values were modeled using
molecular descriptor free technique based on structural barcoding
and convolutional neural network deep learning. The effect of pH on
the elution pattern of more than 402 emerging contaminants were studied
under diverse LC settings. The uncertainty associated with the CDEA
RI model and the pH effect were addressed and the first RI bank based
on CDEA calibrants was developed. The proposed RI system was used
to enhance identification confidence in suspect and nontarget screening
while facilitating successful comparability of retention index data
between various LC settings. The CDEA RI app can be accessed at https://github.com/raalizadeh/RIdea