141 research outputs found

    Basic Validation Procedures for Regression Models in QSAR and QSPR Studies: Theory and Application

    Get PDF
    Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Four quantitative structure-activity relationships (QSAR) and quantitative structure-property relationship (QSPR) data sets were selected from the literature and used to build regression models with 75, 56, 50 and 15 training samples. The models were validated by leave-one-out crossvalidation, leave-N-out crossvalidation (LNO), external validation, y-randomization and bootstrapping. Validations have shown that the size of the training sets is the crucial factor in determining model performance, which deteriorates as the data set becomes smaller. Models from very small data sets suffer from the impossibility of being thoroughly validated, failure and atypical behavior in certain validations (chance correlation, lack of robustness to resampling and LNO), regardless of their good performance in leave-one-out crossvalidation, fitting and even in external validation. A simple determination of the critical N in LNO has been introduced by using the limit of 0.1 for oscillations in Q(2), quantified as the variation range in single LNO and two standard deviations in multiple LNO. It has been demonstrated that it is sufficient to perform 10-25 y-randomization and bootstrap runs for a typical model validation. The bootstrap schemes based on hierarchical cluster analysis give more reliable and reasonable results than bootstraps relying only on randomization of the complete data set. Data quality in terms of statistical significance of descriptor-y relationships is the second important factor for model performance. Variable selection that does not eliminate insignificant descriptor-y relationships may lead to situations in which they are not detected during model validation, especially when dealing with large data sets.204770787Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP

    The Use of Near-Infrared Spectroscopy and Chemometrics for Determining the Shelf-Life of Products

    Get PDF
    In this work a procedure for determining the shelf-life of products by merging near-infrared (NIR) spectroscopy, multivariate techniques of data analysis, and kinetic theory is presented. This procedure allows information from several sources (sensory, physical chemical, and instrumental) to be merged via the multivariate accelerated shelf-life test (MASLT) algorithm. The MASLT is a multivariate approach that relies on soft modeling via an unfolding principal component analysis (u-PCA) and hard modeling, through the conventional kinetic theory, for determining the final shelf-life of products. The procedure was successfully applied to a consumer goods product (a body lotion), whose shelf-life was determined to be 3 years and 9 months, in accordance with results previously obtained using conventional analytical techniques and univariate methods of data analysis.63111308131

    A Priori Descriptors in QSAR: a Case of Gram-Negative Bacterial Multidrug Resistance to beta-Lactams

    Get PDF
    Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Over hundred new a priori global/local molecular descriptors that encoded steric, topological, electronic, hydrogen bonding. compositional and hydrophobic properties were generated for 16 beta-lactams. and two partial least squares regression models were constructed and cross-validated. These a priori models (Q(2) > 0.90, R(2) > 0.95 SEV < 0.50) are comparable with the previously obtained computed models. beta-lactam intramolecular and beta-lactam-receptor intermolecular interactions are also discussed in terms of molecular descriptors.814579592Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP

    The Violacein Biosynthesis Monitored by Multi-Wavelength Fluorescence Spectroscopy and by the PARAFAC Method

    Get PDF
    Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Obtaining information about a biosynthetic pathway is a complex and laborious procedure. In this sense, this work presents a new approach for the initial analysis of the biosynthesis of fluorescent natural products using as example the violacein biosynthesis. For this, a culture of Chromobacterium violaceum was grown in a bioreactor from which aliquots were collected every 2 h for subsequent analysis by multi-wavelength fluorescence spectroscopy. The excitation-emission matrices demonstrated the dynamic behavior of the fluorophores signal that are consumed and produced by the bacterium. These signals were resolved by PARAFAC (parallel factor analysis) method totalizing six pure components. Tryptophan and violacein were identified by comparison to spectra available in the literature. The identification of other fluorophores was critical step due to the lack of a database of fluorescent natural products to compare spectra. Finally, this methodology has great potential to achieve a deeper insight into the biosynthesis of natural products.231120542064Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq

    Convergence of marine megafauna movement patterns in coastal and open oceans

    Get PDF
    The extent of increasing anthropogenic impacts on large marine vertebrates partly depends on the animals’ movement patterns. Effective conservation requires identification of the key drivers of movement including intrinsic properties and extrinsic constraints associated with the dynamic nature of the environments the animals inhabit. However, the relative importance of intrinsic versus extrinsic factors remains elusive. We analyze a global dataset of ∼2.8 million locations from >2,600 tracked individuals across 50 marine vertebrates evolutionarily separated by millions of years and using different locomotion modes (fly, swim, walk/paddle). Strikingly, movement patterns show a remarkable convergence, being strongly conserved across species and independent of body length and mass, despite these traits ranging over 10 orders of magnitude among the species studied. This represents a fundamental difference between marine and terrestrial vertebrates not previously identified, likely linked to the reduced costs of locomotion in water. Movement patterns were primarily explained by the interaction between species-specific traits and the habitat(s) they move through, resulting in complex movement patterns when moving close to coasts compared with more predictable patterns when moving in open oceans. This distinct difference may be associated with greater complexity within coastal microhabitats, highlighting a critical role of preferred habitat in shaping marine vertebrate global movements. Efforts to develop understanding of the characteristics of vertebrate movement should consider the habitat(s) through which they move to identify how movement patterns will alter with forecasted severe ocean changes, such as reduced Arctic sea ice cover, sea level rise, and declining oxygen content
    corecore