813 research outputs found

    Intelligent Modelling of the Environmental Behaviour of Chemicals

    Get PDF
    In view of the new European Union chemical policy REACH (Registration, Evaluation, and Authorization of Chemicals), interest in "non-animal" methods for assessing the risk potentials of chemicals towards human health and environment has increased. The incapability of classical modelling approaches in the complex and ill-defined modelling problems of chemicals' environmental behavior, together with an availability of large computing power in modern times raise an interest in applying computational models inspired by the approaches coming from the area of artificial intelligence. This thesis is devoted to promote the applications of neuro/fuzzy techniques in assessing the environmental behavior of chemicals. Some of the bottlenecks lying in the neuro/fuzzy modelling of chemicals' behavior towards environment have been identified and the solutions have been provided based on the techniques of computational intelligence.Diese Dissertation beinhaltet die Anwendung von neuronalen bzw. fuzzy Netzen, um das Umweltverhalten von Chemikalien beurteilen zu können. In dieser Arbeit werden die Probleme der Modellierung von Chemikalien gegenüber der Umwelt aufgezeigt und Lösungen angeboten. Die Lösungen basieren auf künstlichen Intelligenztechniken. Die Qualität der Modellierungstechniken hängt von mehreren Faktoren ab, z.B. der Eingabe, der Struktur und so weiter. In vielen Fällen werden keine geeigneten Resultate erhalten. So läuft es auf die Entwicklung eines Modells mit einer niedrigen Generalisierungsfähigkeit (Verallgemeinerungsfähigkeit)hinaus

    Preface

    Get PDF

    On the development of decision-making systems based on fuzzy models to assess water quality in rivers

    Get PDF
    There are many situations where a linguistic description of complex phenomena allows better assessments. It is well known that the assessment of water quality continues depending heavily upon subjective judgments and interpretation, despite the huge datasets available nowadays. In that sense, the aim of this study has been to introduce intelligent linguistic operations to analyze databases, and produce self interpretable water quality indicators, which tolerate both imprecision and linguistic uncertainty. Such imprecision typically reflects the ambiguity of human thinking when perceptions need to be expressed. Environmental management concepts such as: "water quality", "level of risk", or "ecological status" are ideally dealt with linguistic variables. In the present Thesis, the flexibility of computing with words offered by fuzzy logic has been considered in these management issues. Firstly, a multipurpose hierarchical water quality index has been designed with fuzzy reasoning. It integrates a wide set of indicators including: organic pollution, nutrients, pathogens, physicochemical macro-variables, and priority micro-contaminants. Likewise, the relative importance of the water quality indicators has been dealt with the analytic hierarchy process, a decision-aiding method. Secondly, a methodology based on a hybrid approach that combines fuzzy inference systems and artificial neural networks has been used to classify ecological status in surface waters according to the Water Framework Directive. This methodology has allowed dealing efficiently with the non-linearity and subjective nature of variables involved in this classification problem. The complexity of inference systems, the appropriate choice of linguistic rules, and the influence of the functions that transform numerical variables into linguistic variables have been studied. Thirdly, a concurrent neuro-fuzzy model based on screening ecological risk assessment has been developed. It has considered the presence of hazardous substances in rivers, and incorporates an innovative ranking and scoring system, based on a self-organizing map, to account for the likely ecological hazards posed by the presence of chemical substances in freshwater ecosystems. Hazard factors are combined with environmental concentrations within fuzzy inference systems to compute ecological risk potentials under linguistic uncertainty. The estimation of ecological risk potentials allows identifying those substances requiring stricter controls and further rigorous risk assessment. Likewise, the aggregation of ecological risk potentials, by means of empirical cumulative distribution functions, has allowed estimating changes in water quality over time. The neuro-fuzzy approach has been validated by comparison with biological monitoring. Finally, a hierarchical fuzzy inference system to deal with sediment based ecological risk assessment has been designed. The study was centered in sediments, since they produce complementary findings to water quality analysis, especially when temporal trends are required. Results from chemical and eco-toxicological analyses have been used as inputs to two parallel inference systems which assess levels of contamination and toxicity, respectively. Results from both inference engines are then treated in a third inference engine which provides a final risk characterization, where the risk is provided in linguistic terms, with their respective degrees of certitude. Inputs to the risk system have been the levels of potentially toxic substances, mainly metals and chlorinated organic compounds, and the toxicity measured with a screening test which uses the photo-luminescent bacteria Vibrio fischeri. The Ebro river basin has been selected as case study, although the methodologies here explained can easily be applied to other rivers. In conclusion, this study has broadly demonstrated that the design of water quality indexes, based on fuzzy logic, emerges as suitable and alternative tool to support decision makers involved in effective sustainable river basin management plans.Existen diversas situaciones en las cuales la descripción en términos lingüísticos de fenómenos complejos permite mejores resultados. A pesar de los volúmenes de información cuantitativa que se manejan actualmente, es bien sabido que la gestión de la calidad del agua todavía obedece a juicios subjetivos y de interpretación de los expertos. Por tanto, el reto en este trabajo ha sido la introducción de operaciones lógicas que computen con palabras durante el análisis de los datos, para la elaboración de indicadores auto-interpretables de calidad del agua, que toleren la imprecisión e incertidumbre lingüística. Esta imprecisión típicamente refleja la ambigüedad del pensamiento humano para expresar percepciones. De allí que las variables lingüísticas se presenten como muy atractivas para el manejo de conceptos de la gestión medioambiental, como es el caso de la "calidad del agua", el "nivel de riesgo" o el "estado ecológico". Por tanto, en la presente Tesis, la flexibilidad de la lógica difusa para computar con palabras se ha adaptado a diversos tópicos en la gestión de la calidad del agua. Primero, se desarrolló un índice jerárquico multipropósito de calidad del agua que se obtuvo mediante razonamiento difuso. El índice integra un extenso grupo de indicadores que incluyen: contaminación orgánica, nutrientes, patógenos, variables macroscópicas, así como sustancias prioritarias micro-contaminantes. La importancia relativa de los indicadores al interior del sistema de inferencia se estimó con un método de análisis de decisiones, llamado proceso jerárquico analítico. En una segunda fase, se utilizó una metodología híbrida que combina los sistemas de inferencia difusos y las redes neuronales artificiales, conocida como neuro-fuzzy, para el estudio de la clasificación del estado ecológico de los ríos, de acuerdo con los lineamientos de la Directiva Marco de Aguas. Esta metodología permitió un manejo adecuado de la no-linealidad y naturaleza subjetiva de las variables involucradas en este problema clasificatorio. Con ella, se estudió la complejidad de los sistemas de inferencia, la selección apropiada de reglas lingüísticas y la influencia de las funciones que transforman las variables numéricas en lingüísticas. En una tercera fase, se desarrolló un modelo conceptual neuro-fuzzy concurrente basado en la metodología de evaluación de riesgo ecológico preliminar. Este modelo consideró la presencia de sustancias peligrosas en los ríos, e incorporó un mapa auto-organizativo para clasificar las sustancias químicas, en términos de su peligrosidad hacia los ecosistemas acuáticos. Con este modelo se estimaron potenciales de riesgo ecológico por combinación de factores de peligrosidad y de concentraciones de las sustancias químicas en el agua. Debido a la alta imprecisión e incertidumbre lingüística, estos potenciales se obtuvieron mediante sistemas de inferencia difusos, y se integraron por medio de distribuciones empíricas acumuladas, con las cuales se pueden analizar cambios espacio-temporales en la calidad del agua. Finalmente, se diseñó un sistema jerárquico de inferencia difuso para la evaluación del riesgo ecológico en sedimentos de ribera. Este sistema estima los grados de contaminación, toxicidad y riesgo en los sedimentos en términos lingüísticos, con sus respectivos niveles de certeza. El sistema se alimenta con información proveniente de análisis químicos, que detectan la presencia de sustancias micro-contaminantes, y de ensayos eco-toxicológicos tipo "screening" que usan la bacteria Vibrio fischeri. Como caso de estudio se seleccionó la cuenca del río Ebro, aunque las metodologías aquí desarrolladas pueden aplicarse fácilmente a otros ríos. En conclusión, este trabajo demuestra ampliamente que el diseño y aplicación de indicadores de calidad de las aguas, basados en la metodología de la lógica difusa, constituyen una herramienta sencilla y útil para los tomadores de decisiones encargados de la gestión sostenible de las cuencas hidrográficas

    ADAPTS: An Intelligent Sustainable Conceptual Framework for Engineering Projects

    Get PDF
    This paper presents a conceptual framework for the optimization of environmental sustainability in engineering projects, both for products and industrial facilities or processes. The main objective of this work is to propose a conceptual framework to help researchers to approach optimization under the criteria of sustainability of engineering projects, making use of current Machine Learning techniques. For the development of this conceptual framework, a bibliographic search has been carried out on the Web of Science. From the selected documents and through a hermeneutic procedure the texts have been analyzed and the conceptual framework has been carried out. A graphic representation pyramid shape is shown to clearly define the variables of the proposed conceptual framework and their relationships. The conceptual framework consists of 5 dimensions; its acronym is ADAPTS. In the base are: (1) the Application to which it is intended, (2) the available DAta, (3) the APproach under which it is operated, and (4) the machine learning Tool used. At the top of the pyramid, (5) the necessary Sensing. A study case is proposed to show its applicability. This work is part of a broader line of research, in terms of optimization under sustainability criteria.Telefónica Chair “Intelligence in Networks” of the University of Seville (Spain

    Soft computing approaches to uncertainty propagation in environmental risk mangement

    Get PDF
    Real-world problems, especially those that involve natural systems, are complex and composed of many nondeterministic components having non-linear coupling. It turns out that in dealing with such systems, one has to face a high degree of uncertainty and tolerate imprecision. Classical system models based on numerical analysis, crisp logic or binary logic have characteristics of precision and categoricity and classified as hard computing approach. In contrast soft computing approaches like probabilistic reasoning, fuzzy logic, artificial neural nets etc have characteristics of approximation and dispositionality. Although in hard computing, imprecision and uncertainty are undesirable properties, in soft computing the tolerance for imprecision and uncertainty is exploited to achieve tractability, lower cost of computation, effective communication and high Machine Intelligence Quotient (MIQ). Proposed thesis has tried to explore use of different soft computing approaches to handle uncertainty in environmental risk management. The work has been divided into three parts consisting five papers. In the first part of this thesis different uncertainty propagation methods have been investigated. The first methodology is generalized fuzzy α-cut based on the concept of transformation method. A case study of uncertainty analysis of pollutant transport in the subsurface has been used to show the utility of this approach. This approach shows superiority over conventional methods of uncertainty modelling. A Second method is proposed to manage uncertainty and variability together in risk models. The new hybrid approach combining probabilistic and fuzzy set theory is called Fuzzy Latin Hypercube Sampling (FLHS). An important property of this method is its ability to separate randomness and imprecision to increase the quality of information. A fuzzified statistical summary of the model results gives indices of sensitivity and uncertainty that relate the effects of variability and uncertainty of input variables to model predictions. The feasibility of the method is validated to analyze total variance in the calculation of incremental lifetime risks due to polychlorinated dibenzo-p-dioxins and dibenzofurans (PCDD/F) for the residents living in the surroundings of a municipal solid waste incinerator (MSWI) in Basque Country, Spain. The second part of this thesis deals with the use of artificial intelligence technique for generating environmental indices. The first paper focused on the development of a Hazzard Index (HI) using persistence, bioaccumulation and toxicity properties of a large number of organic and inorganic pollutants. For deriving this index, Self-Organizing Maps (SOM) has been used which provided a hazard ranking for each compound. Subsequently, an Integral Risk Index was developed taking into account the HI and the concentrations of all pollutants in soil samples collected in the target area. Finally, a risk map was elaborated by representing the spatial distribution of the Integral Risk Index with a Geographic Information System (GIS). The second paper is an improvement of the first work. New approach called Neuro-Probabilistic HI was developed by combining SOM and Monte-Carlo analysis. It considers uncertainty associated with contaminants characteristic values. This new index seems to be an adequate tool to be taken into account in risk assessment processes. In both study, the methods have been validated through its implementation in the industrial chemical / petrochemical area of Tarragona. The third part of this thesis deals with decision-making framework for environmental risk management. In this study, an integrated fuzzy relation analysis (IFRA) model is proposed for risk assessment involving multiple criteria. The fuzzy risk-analysis model is proposed to comprehensively evaluate all risks associated with contaminated systems resulting from more than one toxic chemical. The model is an integrated view on uncertainty techniques based on multi-valued mappings, fuzzy relations and fuzzy analytical hierarchical process. Integration of system simulation and risk analysis using fuzzy approach allowed to incorporate system modelling uncertainty and subjective risk criteria. In this study, it has been shown that a broad integration of fuzzy system simulation and fuzzy risk analysis is possible. In conclusion, this study has broadly demonstrated the usefulness of soft computing approaches in environmental risk analysis. The proposed methods could significantly advance practice of risk analysis by effectively addressing critical issues of uncertainty propagation problem.Los problemas del mundo real, especialmente aquellos que implican sistemas naturales, son complejos y se componen de muchos componentes indeterminados, que muestran en muchos casos una relación no lineal. Los modelos convencionales basados en técnicas analíticas que se utilizan actualmente para conocer y predecir el comportamiento de dichos sistemas pueden ser muy complicados e inflexibles cuando se quiere hacer frente a la imprecisión y la complejidad del sistema en un mundo real. El tratamiento de dichos sistemas, supone el enfrentarse a un elevado nivel de incertidumbre así como considerar la imprecisión. Los modelos clásicos basados en análisis numéricos, lógica de valores exactos o binarios, se caracterizan por su precisión y categorización y son clasificados como una aproximación al hard computing. Por el contrario, el soft computing tal como la lógica de razonamiento probabilístico, las redes neuronales artificiales, etc., tienen la característica de aproximación y disponibilidad. Aunque en la hard computing, la imprecisión y la incertidumbre son propiedades no deseadas, en el soft computing la tolerancia en la imprecisión y la incerteza se aprovechan para alcanzar tratabilidad, bajos costes de computación, una comunicación efectiva y un elevado Machine Intelligence Quotient (MIQ). La tesis propuesta intenta explorar el uso de las diferentes aproximaciones en la informática blanda para manipular la incertidumbre en la gestión del riesgo medioambiental. El trabajo se ha dividido en tres secciones que forman parte de cinco artículos. En la primera parte de esta tesis, se han investigado diferentes métodos de propagación de la incertidumbre. El primer método es el generalizado fuzzy α-cut, el cual está basada en el método de transformación. Para demostrar la utilidad de esta aproximación, se ha utilizado un caso de estudio de análisis de incertidumbre en el transporte de la contaminación en suelo. Esta aproximación muestra una superioridad frente a los métodos convencionales de modelación de la incertidumbre. La segunda metodología propuesta trabaja conjuntamente la variabilidad y la incertidumbre en los modelos de evaluación de riesgo. Para ello, se ha elaborado una nueva aproximación híbrida denominada Fuzzy Latin Hypercube Sampling (FLHS), que combina los conjuntos de la teoría de probabilidad con la teoría de los conjuntos difusos. Una propiedad importante de esta teoría es su capacidad para separarse los aleatoriedad y imprecisión, lo que supone la obtención de una mayor calidad de la información. El resumen estadístico fuzzificado de los resultados del modelo generan índices de sensitividad e incertidumbre que relacionan los efectos de la variabilidad e incertidumbre de los parámetros de modelo con las predicciones de los modelos. La viabilidad del método se llevó a cabo mediante la aplicación de un caso a estudio donde se analizó la varianza total en la cálculo del incremento del riesgo sobre el tiempo de vida de los habitantes que habitan en los alrededores de una incineradora de residuos sólidos urbanos en Tarragona, España, debido a las emisiones de dioxinas y furanos (PCDD/Fs). La segunda parte de la tesis consistió en la utilización de las técnicas de la inteligencia artificial para la generación de índices medioambientales. En el primer artículo se desarrolló un Índice de Peligrosidad a partir de los valores de persistencia, bioacumulación y toxicidad de un elevado número de contaminantes orgánicos e inorgánicos. Para su elaboración, se utilizaron los Mapas de Auto-Organizativos (SOM), que proporcionaron un ranking de peligrosidad para cada compuesto. A continuación, se elaboró un Índice de Riesgo Integral teniendo en cuenta el Índice de peligrosidad y las concentraciones de cada uno de los contaminantes en las muestras de suelo recogidas en la zona de estudio. Finalmente, se elaboró un mapa de la distribución espacial del Índice de Riesgo Integral mediante la representación en un Sistema de Información Geográfico (SIG). El segundo artículo es un mejoramiento del primer trabajo. En este estudio, se creó un método híbrido de los Mapas Auto-organizativos con los métodos probabilísticos, obteniéndose de esta forma un Índice de Riesgo Integrado. Mediante la combinación de SOM y el análisis de Monte-Carlo se desarrolló una nueva aproximación llamada Índice de Peligrosidad Neuro-Probabilística. Este nuevo índice es una herramienta adecuada para ser utilizada en los procesos de análisis. En ambos artículos, la viabilidad de los métodos han sido validados a través de su aplicación en el área de la industria química y petroquímica de Tarragona (Cataluña, España). El tercer apartado de esta tesis está enfocado en la elaboración de una estructura metodológica de un sistema de ayuda en la toma de decisiones para la gestión del riesgo medioambiental. En este estudio, se presenta un modelo integrado de análisis de fuzzy (IFRA) para la evaluación del riesgo cuyo resultado depende de múltiples criterios. El modelo es una visión integrada de las técnicas de incertidumbre basadas en diseños de valoraciones múltiples, relaciones fuzzy y procesos analíticos jerárquicos inciertos. La integración de la simulación del sistema y el análisis del riesgo utilizando aproximaciones inciertas permitieron incorporar la incertidumbre procedente del modelo junto con la incertidumbre procedente de la subjetividad de los criterios. En este estudio, se ha demostrado que es posible crear una amplia integración entre la simulación de un sistema incierto y de un análisis de riesgo incierto. En conclusión, este trabajo demuestra ampliamente la utilidad de aproximación Soft Computing en el análisis de riesgos ambientales. Los métodos propuestos podría avanzar significativamente la práctica de análisis de riesgos de abordar eficazmente el problema de propagación de incertidumbre

    Integrating in silico models and read-across methods for predicting toxicity of chemicals: A step-wise strategy

    Get PDF
    Abstract In silico methods and models are increasingly used for predicting properties of chemicals for hazard identification and hazard characterisation in the absence of experimental toxicity data. Many in silico models are available and can be used individually or in an integrated fashion. Whilst such models offer major benefits to toxicologists, risk assessors and the global scientific community, the lack of a consistent framework for the integration of in silico results can lead to uncertainty and even contradictions across models and users, even for the same chemicals. In this context, a range of methods for integrating in silico results have been proposed on a statistical or case-specific basis. Read-across constitutes another strategy for deriving reference points or points of departure for hazard characterisation of untested chemicals, from the available experimental data for structurally-similar compounds, mostly using expert judgment. Recently a number of software systems have been developed to support experts in this task providing a formalised and structured procedure. Such a procedure could also facilitate further integration of the results generated from in silico models and read-across. This article discusses a framework on weight of evidence published by EFSA to identify the stepwise approach for systematic integration of results or values obtained from these "non-testing methods". Key criteria and best practices for selecting and evaluating individual in silico models are also described, together with the means to combining the results, taking into account any limitations, and identifying strategies that are likely to provide consistent results

    Developing magnetic functionalized multi-walled carbon nanotubes-based buckypaper for the removal of Furazolid

    Get PDF
    Magnetic f-MWCNTs-based BP/PVA membrane was fabricated and utilized for the elimination of furazolidone (FZD) from aqueous solution. Characterisation and adsorption studies were performed to evaluate the performance and adsorptive efficiency, respectively of the membrane. Furthermore, statistical and machine learning technique were also applied to predict the removal efficiency of FZD on the membrane. The results revealed that magnetic f-MWCNTs-based BP/PVA membrane has the potential to be used as an efficient membrane for practical applications

    Prediction of bioavailability and toxicity of complex chemical mixtures through machine learning models

    Get PDF
    Empirical data from a 6-month mesocosms experiment were used to assess the ability and performance of two machine learning (ML) models, including artificial neural network (NN) and random forest (RF), to predict temporal bioavailability changes of complex chemical mixtures in contaminated soils amended with compost or biochar. From the predicted bioavailability data, toxicity response for relevant ecological receptors was then forecasted to establish environmental risk implications and determine acceptable end-point remediation. The dataset corresponds to replicate samples collected over 180 days and analysed for total and bioavailable petroleum hydrocarbons and heavy metals/metalloids content. Further to this, a range of biological indicators including bacteria count, soil respiration, microbial community fingerprint, seeds germination, earthworm's lethality, and bioluminescent bacteria were evaluated to inform the environmental risk assessment. Parameters such as soil type, amendment (biochar and compost), initial concentration of individual compounds, and incubation time were used as inputs of the ML models. The relative importance of the input variables was also analysed to better understand the drivers of temporal changes in bioavailability and toxicity. It showed that toxicity changes can be driven by multiple factors (combined effects), which may not be accounted for in classical linear regression analysis (correlation). The use of ML models could improve our understanding of rate-limiting processes affecting the freely available fraction (bioavailable) of contaminants in soil, therefore contributing to mitigate potential risks and to inform appropriate response and recovery methods

    Machine learning techniques implementation in power optimization, data processing, and bio-medical applications

    Get PDF
    The rapid progress and development in machine-learning algorithms becomes a key factor in determining the future of humanity. These algorithms and techniques were utilized to solve a wide spectrum of problems extended from data mining and knowledge discovery to unsupervised learning and optimization. This dissertation consists of two study areas. The first area investigates the use of reinforcement learning and adaptive critic design algorithms in the field of power grid control. The second area in this dissertation, consisting of three papers, focuses on developing and applying clustering algorithms on biomedical data. The first paper presents a novel modelling approach for demand side management of electric water heaters using Q-learning and action-dependent heuristic dynamic programming. The implemented approaches provide an efficient load management mechanism that reduces the overall power cost and smooths grid load profile. The second paper implements an ensemble statistical and subspace-clustering model for analyzing the heterogeneous data of the autism spectrum disorder. The paper implements a novel k-dimensional algorithm that shows efficiency in handling heterogeneous dataset. The third paper provides a unified learning model for clustering neuroimaging data to identify the potential risk factors for suboptimal brain aging. In the last paper, clustering and clustering validation indices are utilized to identify the groups of compounds that are responsible for plant uptake and contaminant transportation from roots to plants edible parts --Abstract, page iv

    Interpretable molecular encodings and representations for machine learning tasks

    Get PDF
    Molecular encodings and their usage in machine learning models have demonstrated significant breakthroughs in biomedical applications, particularly in the classification of peptides and proteins. To this end, we propose a new encoding method: Interpretable Carbon-based Array of Neighborhoods (iCAN). Designed to address machine learning models' need for more structured and less flexible input, it captures the neighborhoods of carbon atoms in a counting array and improves the utility of the resulting encodings for machine learning models. The iCAN method provides interpretable molecular encodings and representations, enabling the comparison of molecular neighborhoods, identification of repeating patterns, and visualization of relevance heat maps for a given data set. When reproducing a large biomedical peptide classification study, it outperforms its predecessor encoding. When extended to proteins, it outperforms a lead structure-based encoding on 71% of the data sets. Our method offers interpretable encodings that can be applied to all organic molecules, including exotic amino acids, cyclic peptides, and larger proteins, making it highly versatile across various domains and data sets. This work establishes a promising new direction for machine learning in peptide and protein classification in biomedicine and healthcare, potentially accelerating advances in drug discovery and disease diagnosis
    corecore