Adaptive matrix metrics for molecular descriptor assessment in QSPR classification

Abstract

QSPR methods represent a useful approach in the drug discovery process, since they allow to predict in advance biological or physicochemical properties of a candidate drug. For this goal, it is necessary that the QSPR method be as accurate as possible to provide reliable predictions. Moreover, the selection of the molecular descriptors is an important task to create QSPR prediction models of low complexity which, at the same time, provide accurate predictions. In this work, a matrix-based method is used to transform the original data space of chemical compounds into an alternative space where compounds with different target properties can be better separated. For using this approach, QSPR is considered as a classification problem. The advantage of using adaptive matrix metrics is twofold: it can be used to identify important molecular descriptors and at the same time it allows improving the classification accuracy. A recently proposed method making use of this concept is extended to multi-class data. The new method is related to linear discriminant analysis and shows better results at yet higher computational costs. An application for relating chemical descriptors to hydrophobicity property shows promising results.Fil: Soto, Axel Juan. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Bahía Blanca. Planta Piloto de Ingeniería Química. Universidad Nacional del Sur. Planta Piloto de Ingeniería Química; ArgentinaFil: Strickert, Marc. Leibniz Institute of Plant Genetics and Crop Plant Research; AlemaniaFil: Vazquez, Gustavo Esteban. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Bahía Blanca. Planta Piloto de Ingeniería Química. Universidad Nacional del Sur. Planta Piloto de Ingeniería Química; Argentin

    Similar works