7 research outputs found

    Surveillance de procédés à base de méthodes de classification : conception d'un outil d'aide pour la détection et le diagnostic des défaillances

    No full text
    The present work belongs to the field of decision support systems for complex process monitoring, such as chemical and petrochemical plants. Since it is not always possible to obtain a mathematical model for these processes, it is necessary to consider other approaches such as learning and classification methods, in order to identify their different operating modes (normal or faulty). We propose a strategy based on Data Mining techniques, which allows the construction of a discrete event model of the process behavior using historical and online data. This strategy consists on an offline learning stage for the elaboration of a first reference model. This model, in the form of a finite state automaton, must be validated and completed by the process expert. A second online stage consists in tracking the identified process states. A deviation is detected when a given number of observations are not recognized into any expected functional state. At this stage a new learning procedure is proposed in order to identify the nature of the deviation. The new resulting classes and information concerning the descriptors involved are presented to the expert as support for his diagnosis. A decision support software tool for monitoring processes using LAMDA classification algorithm has been developed based on the proposed strategy. LAMDA method uses Fuzzy logic theory and introduces the adequacy concept for the assignment of an element to a class. Within the context of CHEM European project the principal aspects of our work were tested on different industrial and pilot plants.Les travaux prĂ©sentĂ©s se situent dans le domaine de l'aide Ă  la dĂ©cision pour la surveillance de systĂšmes complexes tels que les procĂ©dĂ©s chimiques. Pour de tels procĂ©dĂ©s il n'est pas toujours possible de disposer d'un modĂšle mathĂ©matique ou structurel du systĂšme considĂ©rĂ©. De ce fait, d'autres types d'approches telles que les mĂ©thodes de classification, doivent ĂȘtre envisagĂ©es pour l'identification des Ă©tats fonctionnels dans lesquels le systĂšme peut se trouver. Sur la base de telles mĂ©thodes notre travail prĂ©sente une stratĂ©gie permettant de construire, Ă  partir de donnĂ©es historiques et de donnĂ©es rĂ©cupĂ©rĂ©es en ligne, un modĂšle discret (Ă©tats/transitions) du comportement du processus et d'identifier des situations anormales issues des dysfonctionnements. Cette stratĂ©gie consiste Ă  gĂ©nĂ©rer un premier modĂšle de rĂ©fĂ©rence, sous la forme d'un automate Ă  Ă©tats finis, du procĂ©dĂ© Ă  partir d'un apprentissage, supervisĂ© ou non. Ce modĂšle est ensuite validĂ© et complĂ©tĂ© par l'expert. La reconnaissance en ligne permet de suivre l'Ă©volution temporelle des modes de fonctionnement dĂ©jĂ  identifiĂ©s. Dans le cas oĂč une transition amĂšne Ă  la non-reconnaissance d'un certain nombre d'Ă©lĂ©ments, c'est-Ă -dire Ă  la dĂ©tection d'une dĂ©viation par rapport Ă  un comportement connu, l'objectif est de caractĂ©riser cette nouvelle situation. Pour cela, nous proposons de faire un nouvel apprentissage hors ligne prenant en compte ces Ă©lĂ©ments non reconnus. Les nouvelles classes crĂ©Ă©es permettent, toujours en interaction avec l'expert, de fixer la nature de la dĂ©viation observĂ©e. Dans le cas d'une dĂ©faillance, une analyse portant sur les descripteurs et le profil des classes permet l'isolation de la dĂ©faillance. Ces informations sont transmises Ă  l'opĂ©rateur pour l'assister dans son diagnostic. Un outil d'aide Ă  la dĂ©cision pour la surveillance s'appuyant sur cette stratĂ©gie a Ă©tĂ© mis en place. Cet outil appelĂ© SALSA repose sur la mĂ©thode LAMDA. Il s'agit d'une mĂ©thode de classification avec apprent issage et reconnaissance de formes qui permet l'analyse de donnĂ©es multi-variables et qui utilise des notions de la logique floue pour introduire le concept d'adĂ©quation d'un Ă©lĂ©ment Ă  une classe. Dans le cadre du projet europĂ©en CHEM les principaux aspects de nos travaux et les rĂ©sultats obtenus ont Ă©tĂ© illustrĂ©s sur des unitĂ©s industrielles de nature diffĂ©rente

    Membership-margin based feature selection for mixed type and high-dimensional data: Theory and applications

    No full text
    International audienceThe present paper describes a new feature weighting method based on a membership margin. Distinctive properties of the proposed method include its capability to process problems characterized by mixed-type data (quantitative, qualitative and interval) as well as a huge number of features. The key idea is to map simultaneously all the features of different types into a common space; the membership space. Once all features are represented in a homogeneous space, a feature weighting task can be performed in unified way. This weighting approach is integrated here within a fuzzy classifier through a fuzzy rule weighted concept in order to improve its performance. Each antecedent fuzzy set in the fuzzy if–then rule is weighted to characterize the importance of each proposition and therefore its corresponding feature. Weight estimation process is based on membership margin maximization to estimate a fuzzy weight of each feature in the membership space. Experiments on low and high dimensional real-world datasets demonstrate that the proposed approach can improve significantly the performance of the fuzzy rule-based as well as other state of the art classifiers and can even outperform classical feature weighting approaches. In particular, we show that this approach can yield meaningful results on two real-world applications for cancer prognosis and industrial process diagnosis

    Situation prediction based on fuzzy clustering for industrial complex processes

    No full text
    International audiencePrediction of process behavior is important and useful to understand the system status and to take early control actions during operation. This paper presents a fuzzy clustering approach for predicting situations (functional states) in complex process industries. The proposed methodology combines a static measurement, such as the result of a fuzzy classifier trained with historical process data, and an estimation algorithm based on Markov‘s theory for discrete event systems. The situation prediction function is integrated into a process monitoring system without increasing the computational cost, which makes real-time implementation feasible. The monitoring strategy includes two principal stages: an offline stage for designing the fuzzy classifier and the predictor, and an online stage for identifying current process situations and for estimating predicted functional states. Thus, at each sample time, the results of a fuzzy classifier are used as inputs in the prediction procedure. An attractive feature of our proposed method, for situation prediction, is that it provides information about the evolution of the process. The proposed approach was tested on a monitoring system for a power transmission line, and also for monitoring a boiler subsystem of a steam generator. Experimental results indicate that our proposed technique in this paper is effective and can be used as a tool, for operators, to be used in industrial process decision making

    Symbolic Data Analysis to Defy Low Signal-to-Noise Ratio in Microarray Data for Breast Cancer Prognosis

    No full text
    19International audienceMicroarray profiling has brought recently the hope to gain new insights into breast cancer biology and thereby improve the performance of current prognostic tools. However, it also poses several serious challenges to classical data analysis techniques related to the characteristics of resulted data, mainly high-dimensionality and low signal-to-noise ratio. Despite the tremendous research work performed to handle the first challenge in the feature selection framework, very little attention has been directed to address the second one. We propose in this paper to address both issues simultaneously based on symbolic data analysis capabilities in order to derive more accurate genetic marker-based prognostic models. In particular, interval data representation is employed to model various uncertainties in microarray measurements. A recent feature selection algorithm that handles symbolic interval data is used then to derive a genetic signature. The predictive value of the derived signature is then assessed by following a rigorous experimental setup and compared to existing prognostic approaches in terms of predictive performance and estimated survival probability. It is shown that the derived signature (GenSym) performs significantly better than other prognostic models, including the 70-gene signature, St. Gallen and NIH criterions

    Fuzzy logic selection as a new reliable tool to identify molecular grade signatures in breast cancer – the INNODIAG study

    Get PDF
    International audienceBackground : Personalized medicine has become a priority in breast cancer patient management. In addition to the routinely used clinicopathological characteristics, clinicians will have to face an increasing amount of data derived from tumor molecular profiling. The aims of this study were to develop a new gene selection method based on a fuzzy logic selection and classification algorithm, and to validate the gene signatures obtained on breast cancer patient cohorts.Methods : We analyzed data from four published gene expression datasets for breast carcinomas. We identified the best discriminating genes by comparing molecular expression profiles between histologic grade 1 and 3 tumors for each of the training datasets. The most pertinent probes were selected and used to define fuzzy molecular grade 1-like (good prognosis) and fuzzy molecular grade 3-like (poor prognosis) profiles. To evaluate the prognostic performance of the fuzzy grade signatures in breast cancer tumors, a Kaplan-Meier analysis was conducted to compare the relapse-free survival deduced from histologic grade and fuzzy molecular grade classification.Results : We applied the fuzzy logic selection on breast cancer databases and obtained four new gene signatures. Analysis in the training public sets showed good performance of these gene signatures for grade (sensitivity from 90% to 95%, specificity 67% to 93%). To validate these gene signatures, we designed probes on custom microarrays and tested them on 150 invasive breast carcinomas. Good performance was obtained with an error rate of less than 10%. For one gene signature, among 74 histologic grade 3 and 18 grade 1 tumors, 88 cases (96%) were correctly assigned. Interestingly histologic grade 2 tumors (n = 58) were split in these two molecular grade categories. Conclusion : We confirmed the use of fuzzy logic selection as a new tool to identify gene signatures with good reliability and increased classification power. This method based on artificial intelligence algorithms was successfully applied to breast cancers molecular grade classification allowing histologic grade 2 classification into grade 1 and grade 2 like to improve patients prognosis. It opens the way to further development for identification of new biomarker combinations in other applications such as prediction of treatment response
    corecore