248 research outputs found

    Review of QSAR Models and Software Tools for Predicting of Genotoxicity and Carcinogenicity

    Get PDF
    This review of QSARs for genotoxicity and carcinogenicity was performed in a broad sense, considering both models available in software tools and models that are published in the literature. The review considered the potential applicability of diverse models to pesticides as well as to other types of regulated chemicals and pharmaceuticals. The availability of models and information on their applicability is summarised in tables, and a range of illustrative or informative examples are described in more detail in the text. In many cases, promising models were identified but they are still at the research stage. For routine application in a regulatory setting, further efforts will be needed to explore the applicability of such models for specific purposes, and to implement them in a practically useful form (i.e. user-friendly software). It is also noted that a range of software tools are research tools suitable for model development, and these require more specialised expertise than other tools that are aimed primarily at end-users such as risk assessors. It is concluded that the most useful models are those which are implemented in software tools and associated with transparent documentation on the model development and validation process. However, it is emphasised that the assessment of model predictions requires a reasonable amount of QSAR knowledge, even if it is not necessary to be a QSAR practitioner.JRC.DG.I.6-Systems toxicolog

    ARTIFICIAL NEURAL NETWORKS: FUNCTIONINGANDAPPLICATIONS IN PHARMACEUTICAL INDUSTRY

    Get PDF
    Artificial Neural Network (ANN) technology is a group of computer designed algorithms for simulating neurological processing to process information and produce outcomes like the thinking process of humans in learning, decision making and solving problems. The uniqueness of ANN is its ability to deliver desirable results even with the help of incomplete or historical data results without a need for structured experimental design by modeling and pattern recognition. It imbibes data through repetition with suitable learning models, similarly to humans, without actual programming. It leverages its ability by processing elements connected with the user given inputs which transfers as a function and provides as output. Moreover, the present output by ANN is a combinational effect of data collected from previous inputs and the current responsiveness of the system. Technically, ANN is associated with highly monitored network along with a back propagation learning standard. Due to its exceptional predictability, the current uses of ANN can be applied to many more disciplines in the area of science which requires multivariate data analysis. In the pharmaceutical process, this flexible tool is used to simulate various non-linear relationships. It also finds its application in the enhancement of pre-formulation parameters for predicting physicochemical properties of drug substances. It also finds its applications in pharmaceutical research, medicinal chemistry, QSAR study, pharmaceutical instrumental engineering. Its multi-objective concurrent optimization is adopted in the drug discovery process, protein structure, rational data analysis also

    The Use of Computational Methods in the Toxicological Assessment of Chemicals in Food: Current Status and Future Prospects

    Get PDF
    A wide range of chemicals are intentionally added to, or unintentially found in, food products, often in very small amounts. Depending on the situation, the experimental data needed to complete a dietary risk assessment, which is the scientific basis for protecting human health, may not be available or obtainable, for reasons of cost, time and animal welfare. For example, toxicity data are often lacking for the metabolites and degradation products of pesticide active ingredients. There is therefore an interest in the development and application of efficient and effective non-animal methods for assessing chemical toxicity, including Quantitative Structure-Activity Relationship (QSAR) models and related computational methods. This report gives an overview of how computational methods are currently used in the field of food safety by national regulatory bodies, international advisory organisations and the food industry. On the basis of an international survey, a comprehensive literature review and a detailed QSAR analysis, a range of recommendations are made with the long-term aim of promoting the judicious use of suitable QSAR methods. The current status of QSAR methods is reviewed not only for toxicological endpoints relevant to dietary risk assessment, but also for Absorption, Distribution, Metabolism and Excretion (ADME) properties, which are often important in discriminating between the toxicological profiles of parent compounds and their reaction products. By referring to the concept of the Threshold of Toxicological Concern (TTC), the risk assessment context in which QSAR methods can be expected to be used is also discussed. This Joint Research Centre (JRC) Reference Report provides a summary and update of the findings obtained in a study carried out by the JRC under the terms of a contract awarded by the European Food Safety Authority (EFSA).JRC.DG.I.6-Systems toxicolog

    Inductive queries for a drug designing robot scientist

    Get PDF
    It is increasingly clear that machine learning algorithms need to be integrated in an iterative scientific discovery loop, in which data is queried repeatedly by means of inductive queries and where the computer provides guidance to the experiments that are being performed. In this chapter, we summarise several key challenges in achieving this integration of machine learning and data mining algorithms in methods for the discovery of Quantitative Structure Activity Relationships (QSARs). We introduce the concept of a robot scientist, in which all steps of the discovery process are automated; we discuss the representation of molecular data such that knowledge discovery tools can analyse it, and we discuss the adaptation of machine learning and data mining algorithms to guide QSAR experiments

    Meta-QSAR: a large-scale application of meta-learning to drug design and discovery.

    Get PDF
    We investigate the learning of quantitative structure activity relationships (QSARs) as a case-study of meta-learning. This application area is of the highest societal importance, as it is a key step in the development of new medicines. The standard QSAR learning problem is: given a target (usually a protein) and a set of chemical compounds (small molecules) with associated bioactivities (e.g. inhibition of the target), learn a predictive mapping from molecular representation to activity. Although almost every type of machine learning method has been applied to QSAR learning there is no agreed single best way of learning QSARs, and therefore the problem area is well-suited to meta-learning. We first carried out the most comprehensive ever comparison of machine learning methods for QSAR learning: 18 regression methods, 3 molecular representations, applied to more than 2700 QSAR problems. (These results have been made publicly available on OpenML and represent a valuable resource for testing novel meta-learning methods.) We then investigated the utility of algorithm selection for QSAR problems. We found that this meta-learning approach outperformed the best individual QSAR learning method (random forests using a molecular fingerprint representation) by up to 13%, on average. We conclude that meta-learning outperforms base-learning methods for QSAR learning, and as this investigation is one of the most extensive ever comparisons of base and meta-learning methods ever made, it provides evidence for the general effectiveness of meta-learning over base-learning

    Fast conditional density estimation for quantitative structure-activity relationships

    Get PDF
    Many methods for quantitative structure-activity relationships (QSARs) deliver point estimates only, without quantifying the uncertainty inherent in the prediction. One way to quantify the uncertainy of a QSAR prediction is to predict the conditional density of the activity given the structure instead of a point estimate. If a conditional density estimate is available, it is easy to derive prediction intervals of activities. In this paper, we experimentally evaluate and compare three methods for conditional density estimation for their suitability in QSAR modeling. In contrast to traditional methods for conditional density estimation, they are based on generic machine learning schemes, more specifically, class probability estimators. Our experiments show that a kernel estimator based on class probability estimates from a random forest classifier is highly competitive with Gaussian process regression, while taking only a fraction of the time for training. Therefore, generic machine-learning based methods for conditional density estimation may be a good and fast option for quantifying uncertainty in QSAR modeling.http://www.aaai.org/ocs/index.php/AAAI/AAAI10/paper/view/181

    Collection and Evaluation of (Q)SAR Models for Mutagenicity and Carcinogenicity

    Get PDF
    This evaluation of the non-commercial (Q)SARs for mutagenicity and carcinogenicity consisted of a preliminary survey (Phase I), and then of a more detailed analysis of short listed models (Phase II). In Phase I, the models were collected from the literature, and then assessed according to the OECD principles based on the information provided by the authors-. Phase I provided the support for short listing a number of promising models, that were analyzed more in depth in Phase II. In Phase II, the information provided by the authors was completed and complemented with a series of analyses aimed at generating an overall profile of each of the short listed models. The models can be divided into two families based on their target: a) congeneric; and b) non-congeneric sets of chemicals. The QSARs for congeneric chemicals include most of the chemical classes top ranking in the EU High Production Volume list, with the notable exception of the halogenated aliphatics. They almost exclusively aim at modeling Salmonella mutagenicity and rodent carcinogenicity, which are crucial toxicological endpoints in the regulatory context. The lack of models for in vivo genotoxicity should be remarked. Overall the short listed models can be interpreted mechanistically, and agree with, and/or support the available scientific knowledge, and most of the models have good statistics. Based on external prediction tests, the QSARs for the potency of congeneric chemicals are 30 to 70 % correct, whereas the models for discriminating between active and inactive chemicals have considerably higher accuracy (63 to 100 %), thus indicating that predicting intervals is more reliable than predicting individual data points. The internal validation procedures (e.g., cross-validation, etc...) did not seem to be a reliable measure of external predictivity. Among the non-local, or global approaches for non-congeneric data sets, four models based on the use of Structural Alerts (SA) were short listed and investigated in more depth. The four sets did not differ to a large extent in their performance. In the general databases of chemicals the SAs appear to agree around 65% with rodent carcinogenicity data, and 75% with Salmonella mutagenicity data. The SAs based models do not seem to work equally efficiently in the discrimination between active and inactive chemicals within individual chemical classes. Thus, their main role is that of preliminary, or large-scale screenings. A priority for future research on the SAs is their expansion to include alerts for nongenotoxic carcinogens. A general indication of this study, valid for both congeneric and noncongeneric models, is that there is uncertainty associated with (Q)SARs; the level of uncertainty has to be considered when using (Q)SAR in a regulatory context. However, (Q)SARs are not meant to be black-box machines for predictions, but have a much larger scope including organization and rationalization of data, contribution to highlight mechanisms of action, complementation of other data from different sources (e.g., experiments). Using only non-testing methods, the larger the evidence from QSARs (several different models, if available) and other approaches (e.g. chemical categories, read across) the higher the confidence in the prediction.JRC.I.3-Toxicology and chemical substance

    Evolutionary Computation and QSAR Research

    Get PDF
    [Abstract] The successful high throughput screening of molecule libraries for a specific biological property is one of the main improvements in drug discovery. The virtual molecular filtering and screening relies greatly on quantitative structure-activity relationship (QSAR) analysis, a mathematical model that correlates the activity of a molecule with molecular descriptors. QSAR models have the potential to reduce the costly failure of drug candidates in advanced (clinical) stages by filtering combinatorial libraries, eliminating candidates with a predicted toxic effect and poor pharmacokinetic profiles, and reducing the number of experiments. To obtain a predictive and reliable QSAR model, scientists use methods from various fields such as molecular modeling, pattern recognition, machine learning or artificial intelligence. QSAR modeling relies on three main steps: molecular structure codification into molecular descriptors, selection of relevant variables in the context of the analyzed activity, and search of the optimal mathematical model that correlates the molecular descriptors with a specific activity. Since a variety of techniques from statistics and artificial intelligence can aid variable selection and model building steps, this review focuses on the evolutionary computation methods supporting these tasks. Thus, this review explains the basic of the genetic algorithms and genetic programming as evolutionary computation approaches, the selection methods for high-dimensional data in QSAR, the methods to build QSAR models, the current evolutionary feature selection methods and applications in QSAR and the future trend on the joint or multi-task feature selection methods.Instituto de Salud Carlos III, PIO52048Instituto de Salud Carlos III, RD07/0067/0005Ministerio de Industria, Comercio y Turismo; TSI-020110-2009-53)Galicia. ConsellerĂ­a de EconomĂ­a e Industria; 10SIN105004P

    Density Functional Theory in the Prediction of Mutagenicity: A Perspective

    Get PDF
    As a field, computational toxicology is concerned with using in silico models to predict and understand the origins of toxicity. It is fast, relatively inexpensive, and avoids the ethical conundrum of using animals in scientific experimentation. In this perspective, we discuss the importance of computational models in toxicology, with a specific focus on the different model types that can be used in predictive toxicological approaches toward mutagenicity (SARs and QSARs). We then focus on how quantum chemical methods, such as density functional theory (DFT), have previously been used in the prediction of mutagenicity. It is then discussed how DFT allows for the development of new chemical descriptors that focus on capturing the steric and energetic effects that influence toxicological reactions. We hope to demonstrate the role that DFT plays in understanding the fundamental, intrinsic chemistry of toxicological reactions in predictive toxicology
    • 

    corecore