560 research outputs found

    Uneven batch data alignment with application to the control of batch end-product quality

    Get PDF
    keywords: Variable batch lengths keywords: Variable batch lengths keywords: Variable batch lengths keywords: Variable batch lengths keywords: Variable batch length

    ADVANCES ON BILINEAR MODELING OF BIOCHEMICAL BATCH PROCESSES

    Full text link
    [EN] This thesis is aimed to study the implications of the statistical modeling approaches proposed for the bilinear modeling of batch processes, develop new techniques to overcome some of the problems that have not been yet solved and apply them to data of biochemical processes. The study, discussion and development of the new methods revolve around the four steps of the modeling cycle, from the alignment, preprocessing and calibration of batch data to the monitoring of batches trajectories. Special attention is given to the problem of the batch synchronization, and its effect on the modeling from different angles. The manuscript has been divided into four blocks. First, a state-of- the-art of the latent structures based-models in continuous and batch processes and traditional univariate and multivariate statistical process control systems is carried out. The second block of the thesis is devoted to the preprocessing of batch data, in particular, to the equalization and synchronization of batch trajectories. The first section addresses the problem of the lack of equalization in the variable trajectories. The different types of unequalization scenarios that practitioners might finnd in batch processes are discussed and the solutions to equalize batch data are introduced. In the second section, a theoretical study of the nature of batch processes and of the synchronization of batch trajectories as a prior step to bilinear modeling is carried out. The topics under discussion are i) whether the same synchronization approach must be applied to batch data in presence of different types of asynchronisms, and ii) whether synchronization is always required even though the length of the variable trajectories are constant across batches. To answer these questions, a thorough study of the most common types of asynchronisms that may be found in batch data is done. Furthermore, two new synchronization techniques are proposed to solve the current problems in post-batch and real-time synchronization. To improve fault detection and classification, new unsupervised control charts and supervised fault classifiers based on the information generated by the batch synchronization are also proposed. In the third block of the manuscript, a research work is performed on the parameter stability associated with the most used synchronization methods and principal component analysis (PCA)-based Batch Multivariate Statistical Process Control methods. The results of this study have revealed that accuracy in batch synchronization has a profound impact on the PCA model parameters stability. Also, the parameter stability is closely related to the type of preprocessing performed in batch data, and the type of model and unfolding used to transform the three-way data structure to two-way. The setting of the parameter stability, the source of variability remaining after preprocessing and the process dynamics should be balanced in such a way that multivariate statistical models are accurate in fault detection and diagnosis and/or in online prediction. Finally, the fourth block introduces a graphical user-friendly interface developed in Matlab code for batch process understanding and monitoring. To perform multivariate analysis, the last developments in process chemometrics, including the methods proposed in this thesis, are implemented.[ES] La presente tesis doctoral tiene como objetivo estudiar las implicaciones de los métodos estadísticos propuestos para la modelización bilineal de procesos por lotes, el desarrollo de nuevas técnicas para solucionar algunos de los problemas más complejos aún por resolver en esta línea de investigación y aplicar los nuevos métodos a datos provenientes de procesos bioquímicos para su evaluación estadística. El estudio, la discusión y el desarrollo de los nuevos métodos giran en torno a las cuatro fases del ciclo de modelización: desde la sincronización, ecualización, preprocesamiento y calibración de los datos, a la monitorización de las trayectorias de las variables del proceso. Se presta especial atención al problema de la sincronización y su efecto en la modelización estadística desde distintas perspectivas. El manuscrito se ha dividido en cuatro grandes bloques. En primer lugar, se realiza una revisión bibliográfica de las técnicas de proyección sobre estructuras latentes para su aplicación en procesos continuos y por lotes, y del diseño de sistemas de control basados en modelos estadísticos multivariantes. El segundo bloque del documento versa sobre el preprocesamiento de los datos, en concreto, sobre la ecualización y la sincronización. La primera parte aborda el problema de la falta de ecualización en las trayectorias de las variables. Se discuten las diferentes políticas de muestreo que se pueden encontrar en procesos por lotes y las soluciones para ecualizar las variables. En la segunda parte de esta sección, se realiza un estudio teórico sobre la naturaleza de los procesos por lotes y de la sincronización de las trayectorias como paso previo a la modelización bilineal. Los temas bajo discusión son: i) si se debe utilizar el mismo enfoque de sincronización en lotes afectados por diferentes tipos de asincronismos, y ii) si la sincronización es siempre necesaria aún y cuando las trayectorias de las variables tienen la misma duración en todos los lotes. Para responder a estas preguntas, se lleva a cabo un estudio exhaustivo de los tipos más comunes de asincronismos que se pueden encontrar en este tipo de datos. Además, se proponen dos nuevas técnicas de sincronización para resolver los problemas existentes en aplicaciones post-morten y en tiempo real. Para mejorar la detección de fallos y la clasificación, también se proponen nuevos gráficos de control no supervisados y clasificadores de fallos supervisados en base a la información generada por la sincronización de los lotes. En el tercer bloque del manuscrito se realiza un estudio de la estabilidad de los parámetros asociados a los métodos de sincronización y a los métodos estadístico multivariante basados en el Análisis de Componentes Principales (PCA) más utilizados para el control de procesos. Los resultados de este estudio revelan que la precisión de la sincronización de las trayectorias tiene un impacto significativo en la estabilidad de los parámetros de los modelos PCA. Además, la estabilidad paramétrica está estrechamente relacionada con el tipo de preprocesamiento realizado en los datos de los lotes, el tipo de modelo a justado y el despliegue utilizado para transformar la estructura de datos de tres a dos dimensiones. El ajuste de la estabilidad de los parámetros, la fuente de variabilidad que queda después del preprocesamiento de los datos y la captura de las dinámicas del proceso deben ser a justados de forma equilibrada de tal manera que los modelos estadísticos multivariantes sean precisos en la detección y diagnóstico de fallos y/o en la predicción en tiempo real. Por último, el cuarto bloque del documento describe una interfaz gráfica de usuario que se ha desarrollado en código Matlab para la comprensión y la supervisión de procesos por lotes. Para llevar a cabo los análisis multivariantes, se han implementado los últimos desarrollos en la quimiometría de proc[CA] Aquesta tesi doctoral te com a objectiu estudiar les implicacions dels mètodes de modelització estadística proposats per a la modelització bilineal de processos per lots, el desenvolupament de noves tècniques per resoldre els problemes encara no resolts en aquesta línia de recerca i aplicar els nous mètodes a les dades dels processos bioquímics. L'estudi, la discussió i el desenvolupament dels nous mètodes giren entorn a les quatre fases del cicle de modelització, des de l'alineació, preprocessament i el calibratge de les dades provinents de lots, a la monitorització de les trajectòries. Es presta especial atenció al problema de la sincronització per lots, i el seu efecte sobre el modelatge des de diferents angles. El manuscrit s'ha dividit en quatre grans blocs. En primer lloc, es realitza una revisió bibliogràfica dels principals mètodes basats en tècniques de projecció sobre estructures latents en processos continus i per lots, així com dels sistemes de control estadístics multivariats. El segon bloc del document es dedica a la preprocessament de les dades provinents de lots, en particular, l' equalització i la sincronització. La primera part aborda el problema de la manca d'equalització en les trajectòries de les variables. Es discuteixen els diferents tipus d'escenaris en que les variables estan mesurades a distints intervals i les solucions per equalitzar-les en processos per lots. A la segona part d'aquesta secció es porta a terme un estudi teòric de la naturalesa dels processos per lots i de la sincronització de les trajectòries de lots com a pas previ al modelatge bilineal. Els temes en discussió són: i) si el mateix enfocament de sincronització ha de ser aplicat a les dades del lot en presència de diferents tipus de asincronismes, i ii) si la sincronització sempre es requereix tot i que la longitud de les trajectòries de les variables són constants en tots el lots. Per respondre a aquestes preguntes, es du a terme un estudi exhaustiu dels tipus més comuns de asincronismes que es poden trobar en les dades provinents de lots. A més, es proposen dues noves tècniques de sincronització per resoldre els problemes existents la sincronització post-morten i en temps real. Per millorar la detecció i la classificació de anomalies, també es proposen nous gràfics de control no supervisats i classificadors de falla supervisats dissenyats en base a la informació generada per la sincronització de lots. En el tercer bloc del manuscrit es realitza un treball de recerca sobre l'estabilitat dels paràmetres associats als mètodes de sincronització i als mètodes estadístics multivariats basats en l'Anàlisi de Components Principals (PCA) més utilitzats per al control de processos. Els resultats d'aquest estudi revelen que la precisió en la sincronització per lots te un profund impacte en l'estabilitat dels paràmetres dels models PCA. A més, l'estabilitat paramètrica està estretament relacionat amb el tipus de preprocessament realitzat en les dades provinents de lots, el tipus de model i el desplegament utilitzat per transformar l'estructura de dades de tres a dos dimensions. L'ajust de l'estabilitat dels paràmetres, la font de variabilitat que queda després del preprocessament i la captura de la dinàmica de procés ha de ser equilibrada de tal manera que els models estadístics multivariats són precisos en la detecció i diagnòstic de fallades i/o en la predicció en línia. Finalment, el quart bloc del document introdueix una interfície gràfica d'usuari que s'ha dissenyat e implementat en Matlab per a la comprensió i la supervisió de processos per lots. Per dur a terme aquestes anàlisis multivariats, s'han implementat els últims desenvolupaments en la quimiometria de processos, incloent-hi els mètodes proposats en aquesta tesi.González Martínez, JM. (2015). ADVANCES ON BILINEAR MODELING OF BIOCHEMICAL BATCH PROCESSES [Tesis doctoral]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/55684TESISPremios Extraordinarios de tesis doctorale

    Multivariate statistical data analysis of cell-free protein synthesis toward monitoring and control

    Get PDF
    The optimization and control of cell free protein synthesis (CFPS) presents an ongoing challenge due to the complex synergies and nonlinearities that cannot be fully explained in first principle models. This article explores the use of multivariate statistical tools for analyzing data sets collected from the CFPS of Cereulide monoclonal antibodies. During the collection of these data sets, several of the process parameters were modified to investigate their effect on the end‐point product (yield). Through the application of principal component analysis and partial least squares (PLS), important correlations in the process could be identified. For example, yield had a positive correlation with pH and NH3 and a negative correlation with CO2 and dissolved oxygen. It was also found that PLS was able to provide a long‐term prediction of product yield. The presented work illustrates that multivariate statistical techniques provide important insights that can help support the operation and control of CFPS processes

    A Review of Kernel Methods for Feature Extraction in Nonlinear Process Monitoring

    Get PDF
    Kernel methods are a class of learning machines for the fast recognition of nonlinear patterns in any data set. In this paper, the applications of kernel methods for feature extraction in industrial process monitoring are systematically reviewed. First, we describe the reasons for using kernel methods and contextualize them among other machine learning tools. Second, by reviewing a total of 230 papers, this work has identified 12 major issues surrounding the use of kernel methods for nonlinear feature extraction. Each issue was discussed as to why they are important and how they were addressed through the years by many researchers. We also present a breakdown of the commonly used kernel functions, parameter selection routes, and case studies. Lastly, this review provides an outlook into the future of kernel-based process monitoring, which can hopefully instigate more advanced yet practical solutions in the process industries

    An Integrated Approach to Performance Monitoring and Fault Diagnosis of Nuclear Power Systems

    Get PDF
    In this dissertation an integrated framework of process performance monitoring and fault diagnosis was developed for nuclear power systems using robust data driven model based methods, which comprises thermal hydraulic simulation, data driven modeling, identification of model uncertainty, and robust residual generator design for fault detection and isolation. In the applications to nuclear power systems, on the one hand, historical data are often not able to characterize the relationships among process variables because operating setpoints may change and thermal fluid components such as steam generators and heat exchangers may experience degradation. On the other hand, first-principle models always have uncertainty and are often too complicated in terms of model structure to design residual generators for fault diagnosis. Therefore, a realistic fault diagnosis method needs to combine the strength of first principle models in modeling a wide range of anticipated operation conditions and the strength of data driven modeling in feature extraction. In the developed robust data driven model-based approach, the changes in operation conditions are simulated using the first principle models and the model uncertainty is extracted from plant operation data such that the fault effects on process variables can be decoupled from model uncertainty and normal operation changes. It was found that the developed robust fault diagnosis method was able to eliminate false alarms due to model uncertainty and deal with changes in operating conditions throughout the lifetime of nuclear power systems. Multiple methods of robust data driven model based fault diagnosis were developed in this dissertation. A complete procedure based on causal graph theory and data reconciliation method was developed to investigate the causal relationships and the quantitative sensitivities among variables so that sensor placement could be optimized for fault diagnosis in the design phase. Reconstruction based Principal Component Analysis (PCA) approach was applied to deal with both simple faults and complex faults for steady state diagnosis in the context of operation scheduling and maintenance management. A robust PCA model-based method was developed to distinguish the differences between fault effects and model uncertainties. In order to improve the sensitivity of fault detection, a hybrid PCA model based approach was developed to incorporate system knowledge into data driven modeling. Subspace identification was proposed to extract state space models from thermal hydraulic simulations and a robust dynamic residual generator design algorithm was developed for fault diagnosis for the purpose of fault tolerant control and extension to reactor startup and load following operation conditions. The developed robust dynamic residual generator design algorithm is unique in that explicit identification of model uncertainty is not necessary. Finally, it was demonstrated that the developed new methods for the IRIS Helical Coil Steam Generator (HCSG) system. A simulation model was first developed for this system. It was revealed through steady state simulation that the primary coolant temperature profile could be used to indicate the water inventory inside the HCSG tubes. The performance monitoring and fault diagnosis module was then developed to monitor sensor faults, flow distribution abnormality, and heat performance degradation for both steady state and dynamic operation conditions. This dissertation bridges the gap between the theoretical research on computational intelligence and the engineering design in performance monitoring and fault diagnosis for nuclear power systems. The new algorithms have the potential of being integrated into the Generation III and Generation IV nuclear reactor I&C design after they are tested on current nuclear power plants or Generation IV prototype reactors

    Machine Learning Based Approaches to Estimation of Quality Variables in Batch Processes

    Get PDF
    학위논문(석사) -- 서울대학교대학원 : 공과대학 화학생물공학부, 2023. 2. 이종민.안전하고 경제적인 배치 공정의 운전 방향을 제시하기 위해서 제품 품질 예측 모델의 개발이 필요하다. 하지만 배치 공정은 비정상상태에서 운전되어 강한 비선형성을 보이기 때문에 물리 법칙 기반 모델의 구축이 어렵다. 또한 각 배치 별로 서로 다른 운전 시간을 가지고 운전 조건에 따라 여러 개의 phase로 구성되는 배치 공정의 특성은 품질 예측 모델의 개발을 더욱 어렵게 만든다. 따라서 본 연구에서는 서로 다른 운전 시간을 갖는 배치 공정에 대해서 공정 운전의 phase를 고려한 데이터 기반 품질 예측 모델을 개발하였다. 공정 데이터의 시계열성을 고려한 phase 분할을 위해서 warped K-means clustering (WKM) 알고리즘을 적용하여 phase를 분할하였다. 분리된 각각의 phase에 대하여 별도의 Recurrent neural network (RNN) 셀을 학습시키는 형태의 품질 예측 모델을 개발하였다. 개발된 품질 예측 모델은 기존의 대표적인 방법론인 Dynamic time warping (DTW)과 Multiway partial least squares (MPLS)를 사용한 경우보다 좋은 성능을 보였다. 또한 phase를 구분하지 않고 하나의 RNN 셀을 사용한 경우보다 phase 별로 서로 다른 RNN 셀을 사용하는 경우에 더 좋은 성능을 보이는 것을 확인하였다. 추가적으로 해당 품질 예측 모델을 온라인으로 활용 가능한 형태로 발전시킴으로써 공정 운전 중 발생하는 이상 진단 및 최적 운전 조건의 탐색에 활용할 수 있을 것이다.It is necessary to develop a product quality estimation model to operate batch processes in a safe and economical way. However, since the batch processes are operated in an unsteady state and show strong nonlinearity, it is difficult to build a first principle model. In addition, the batch processes show uneven operating duration for each batch and consist of several phases according to operating conditions, which makes the development of a quality estimation model more difficult. In this study, a machine learning based approach to estimation of quality variables considering the uneven duration and the multi-phase of batch processes was developed. The phases were divided by applying the warped K-means clustering (WKM) algorithm considering the sequentiality of process data. The quality estimation model in the form of a separate Recurrent Neural Network (RNN) cell for each phase was used. The developed quality estimation model showed better performance than the model using dynamic time warping (DTW) and multiway partial least squares (MPLS). In addition, using different RNN cells for each phase shows better performance than using one RNN cell without distinguishing the phases. By developing the quality estimation model into a form that can be used online, it will be possible to use the developed model for diagnosing abnormalities and searching for optimal operating conditions.제 1 장 서론 5 제 2 장 방법론 12 제 1 절 배치 공정 데이터의 전처리 12 제 2 절 Dynamic Time Warping 18 제 3 절 Multiway Partial Least Squares 20 제 4 절 Phase Partition 21 제 1 항 Sub-PCA 알고리즘 21 제 2 항 Warped K-Means Clustering 알고리즘 23 제 5 절 Recurrent Neural Network 26 제 1 항 Vanilla RNN, LSTM, GRU 26 제 2 항 Multi RNN 모델 31 제 3 장 페니실린 생산 공정 33 제 4 장 결과 및 분석 38 제 1 절 DTW-MPLS 모델 39 제 2 절 Single RNN 모델 45 제 3 절 Multi RNN 모델 52 제 1 항 Phase Partition 52 제 2 항 Multi RNN 모델 59 제 5 장 결론 70석

    Process Monitoring and Data Mining with Chemical Process Historical Databases

    Get PDF
    Modern chemical plants have distributed control systems (DCS) that handle normal operations and quality control. However, the DCS cannot compensate for fault events such as fouling or equipment failures. When faults occur, human operators must rapidly assess the situation, determine causes, and take corrective action, a challenging task further complicated by the sheer number of sensors. This information overload as well as measurement noise can hide information critical to diagnosing and fixing faults. Process monitoring algorithms can highlight key trends in data and detect faults faster, reducing or even preventing the damage that faults can cause. This research improves tools for process monitoring on different chemical processes. Previously successful monitoring methods based on statistics can fail on non-linear processes and processes with multiple operating states. To address these challenges, we develop a process monitoring technique based on multiple self-organizing maps (MSOM) and apply it in industrial case studies including a simulated plant and a batch reactor. We also use standard SOM to detect a novel event in a separation tower and produce contribution plots which help isolate the causes of the event. Another key challenge to any engineer designing a process monitoring system is that implementing most algorithms requires data organized into “normal” and “faulty”; however, data from faulty operations can be difficult to locate in databases storing months or years of operations. To assist in identifying faulty data, we apply data mining algorithms from computer science and compare how they cluster chemical process data from normal and faulty conditions. We identify several techniques which successfully duplicated normal and faulty labels from expert knowledge and introduce a process data mining software tool to make analysis simpler for practitioners. The research in this dissertation enhances chemical process monitoring tasks. MSOM-based process monitoring improves upon standard process monitoring algorithms in fault identification and diagnosis tasks. The data mining research reduces a crucial barrier to the implementation of monitoring algorithms. The enhanced monitoring introduced can help engineers develop effective and scalable process monitoring systems to improve plant safety and reduce losses from fault events

    Multi-synchro: a novel approach for batch synchronization in scenarios of multiple asynchronisms

    Full text link
    Batch synchronization has been widely misunderstood as being only needed when variable trajectories have uneven length. Batch data are actually considered not synchronized when the key process events do not occur at the same point of process evolution, irrespective of whether the batch duration is the same for all batches or not. Additionally, a single synchronization procedure is usually applied to all batches without taking into account the nature of asynchronism of each batch, and the presence of abnormalities. This strategy may distort the original trajectories and decrease the signal-to-noise ratio, affecting the subsequent multivariate analyses. The approach proposed in this paper, named multisynchro, overcomes these pitfalls in scenarios of multiple asynchronisms. The different types of asynchronisms are effectively detected by using the warping information derived from synchronization. Each set of batch trajectories is synchronized by appropriate synchronization procedures, which are automatically selected based on the nature of asynchronisms present in data. The novel approach also includes a procedure that performs abnormality detection and batch synchronization in an iterative manner. Data from realistic simulations of a fermentation process of the Saccharomyces cerevisiae cultivation are used to illustrate the performance of the proposed approach in a context of multiple asynchronisms.This research work was partially supported by the Spanish Ministry of Economy and Competitiveness under the project DPI2011-28112-C04-02. Part of this research work was carried out during an internship of the corresponding author at Shell Global Solutions International B.V. (Amsterdam, The Netherlands). The authors also thank the anonymous referees for their comments, which greatly helped to improve the text.González Martínez, JM.; De Noord, O.; Ferrer, A. (2014). Multi-synchro: a novel approach for batch synchronization in scenarios of multiple asynchronisms. Journal of Chemometrics. 28(5):462-475. https://doi.org/10.1002/cem.2620S462475285Kourti, T. (2009). Multivariate Statistical Process Control and Process Control, Using Latent Variables. Comprehensive Chemometrics, 21-54. doi:10.1016/b978-044452701-1.00013-2Wold, S., Kettaneh-Wold, N., MacGregor, J. F., & Dunn, K. G. (2009). Batch Process Modeling and MSPC. Comprehensive Chemometrics, 163-197. doi:10.1016/b978-044452701-1.00108-3Kourti, T. (2003). Abnormal situation detection, three-way data and projection methods; robust data archiving and modeling for industrial applications. Annual Reviews in Control, 27(2), 131-139. doi:10.1016/j.arcontrol.2003.10.004Lakshminarayanan S Gudi R Shah S Monitoring batch processes using multivariate statistical tools: extensions and practical issues. 1996 241 246Zarzo, M., & Ferrer, A. (2004). Batch process diagnosis: PLS with variable selection versus block-wise PCR. Chemometrics and Intelligent Laboratory Systems, 73(1), 15-27. doi:10.1016/j.chemolab.2003.11.009Louwerse, D. J., & Smilde, A. K. (2000). Multivariate statistical process control of batch processes based on three-way models. Chemical Engineering Science, 55(7), 1225-1235. doi:10.1016/s0009-2509(99)00408-xWesterhuis, J. A., Kourti, T., & MacGregor, J. F. (1999). Comparing alternative approaches for multivariate statistical analysis of batch process data. Journal of Chemometrics, 13(3-4), 397-413. doi:10.1002/(sici)1099-128x(199905/08)13:3/43.0.co;2-iNomikos, P., & MacGregor, J. F. (1994). Monitoring batch processes using multiway principal component analysis. AIChE Journal, 40(8), 1361-1375. doi:10.1002/aic.690400809Ündey, C., Ertunç, S., & Çınar, A. (2003). Online Batch/Fed-Batch Process Performance Monitoring, Quality Prediction, and Variable-Contribution Analysis for Diagnosis. Industrial & Engineering Chemistry Research, 42(20), 4645-4658. doi:10.1021/ie0208218Neogi, D., & Schlags, C. E. (1998). Multivariate Statistical Analysis of an Emulsion Batch Process. Industrial & Engineering Chemistry Research, 37(10), 3971-3979. doi:10.1021/ie980243oKourti, T., Lee, J., & Macgregor, J. F. (1996). Experiences with industrial applications of projection methods for multivariate statistical process control. Computers & Chemical Engineering, 20, S745-S750. doi:10.1016/0098-1354(96)00132-9Duchesne, C., Kourti, T., & MacGregor, J. F. (2002). Multivariate SPC for startups and grade transitions. AIChE Journal, 48(12), 2890-2901. doi:10.1002/aic.690481216Zhang, Y., Dudzic, M., & Vaculik, V. (2003). Integrated monitoring solution to start-up and run-time operations for continuous casting. Annual Reviews in Control, 27(2), 141-149. doi:10.1016/j.arcontrol.2003.10.002Rothwell SG Martin EB Morris AJ Comparison of methods for handling unequal length batches 1998 66 71García-Muñoz, S., Kourti, T., MacGregor, J. F., Mateos, A. G., & Murphy, G. (2003). Troubleshooting of an Industrial Batch Process Using Multivariate Methods. Industrial & Engineering Chemistry Research, 42(15), 3592-3601. doi:10.1021/ie0300023Wold, S., Kettaneh, N., Fridén, H., & Holmberg, A. (1998). Modelling and diagnostics of batch processes and analogous kinetic experiments. Chemometrics and Intelligent Laboratory Systems, 44(1-2), 331-340. doi:10.1016/s0169-7439(98)00162-2Kaistha, N., & Moore, C. F. (2001). Extraction of Event Times in Batch Profiles for Time Synchronization and Quality Predictions. Industrial & Engineering Chemistry Research, 40(1), 252-260. doi:10.1021/ie990937cRamsay, J. O., & Silverman, B. W. (1997). Functional Data Analysis. Springer Series in Statistics. doi:10.1007/978-1-4757-7107-7Statistical monitoring of multistage, multiphase batch processes. (2002). IEEE Control Systems, 22(5), 40-52. doi:10.1109/mcs.2002.1035216Andersen, S. W., & Runger, G. C. (2012). Automated feature extraction from profiles with application to a batch fermentation process. Journal of the Royal Statistical Society: Series C (Applied Statistics), 61(2), 327-344. doi:10.1111/j.1467-9876.2011.01032.xSrinivasan, R., & Qian, M. S. (2005). Off-line Temporal Signal Comparison Using Singular Points Augmented Time Warping. Industrial & Engineering Chemistry Research, 44(13), 4697-4716. doi:10.1021/ie049528tSrinivasan, R., & Sheng Qian, M. (2006). Online fault diagnosis and state identification during process transitions using dynamic locus analysis. Chemical Engineering Science, 61(18), 6109-6132. doi:10.1016/j.ces.2006.05.037Srinivasan, R., & Qian, M. (2007). Online Temporal Signal Comparison Using Singular Points Augmented Time Warping. Industrial & Engineering Chemistry Research, 46(13), 4531-4548. doi:10.1021/ie060111sChen, J., & Liu, J. (2000). Post analysis on different operating time processes using orthonormal function approximation and multiway principal component analysis. Journal of Process Control, 10(5), 411-418. doi:10.1016/s0959-1524(00)00016-0Sakoe, H., & Chiba, S. (1978). Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(1), 43-49. doi:10.1109/tassp.1978.1163055Nielsen, N.-P. V., Carstensen, J. M., & Smedsgaard, J. (1998). Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping. Journal of Chromatography A, 805(1-2), 17-35. doi:10.1016/s0021-9673(98)00021-1Pravdova, V., Walczak, B., & Massart, D. L. (2002). A comparison of two algorithms for warping of analytical signals. Analytica Chimica Acta, 456(1), 77-92. doi:10.1016/s0003-2670(02)00008-9Tomasi, G., van den Berg, F., & Andersson, C. (2004). Correlation optimized warping and dynamic time warping as preprocessing methods for chromatographic data. Journal of Chemometrics, 18(5), 231-241. doi:10.1002/cem.859Kassidas, A., MacGregor, J. F., & Taylor, P. A. (1998). Synchronization of batch trajectories using dynamic time warping. AIChE Journal, 44(4), 864-875. doi:10.1002/aic.690440412Gollmer, K., & Posten, C. (1996). Supervision of bioprocesses using a dynamic time warping algorithm. Control Engineering Practice, 4(9), 1287-1295. doi:10.1016/0967-0661(96)00136-0Ramaker, H.-J., van Sprang, E. N. M., Westerhuis, J. A., & Smilde, A. K. (2003). Dynamic time warping of spectroscopic BATCH data. Analytica Chimica Acta, 498(1-2), 133-153. doi:10.1016/j.aca.2003.08.045Fransson, M., & Folestad, S. (2006). Real-time alignment of batch process data using COW for on-line process monitoring. Chemometrics and Intelligent Laboratory Systems, 84(1-2), 56-61. doi:10.1016/j.chemolab.2006.04.020González-Martínez, J. M., Ferrer, A., & Westerhuis, J. A. (2011). Real-time synchronization of batch trajectories for on-line multivariate statistical process control using Dynamic Time Warping. Chemometrics and Intelligent Laboratory Systems, 105(2), 195-206. doi:10.1016/j.chemolab.2011.01.003Gins, G., Van den Kerkhof, P., & Van Impe, J. F. M. (2012). Hybrid Derivative Dynamic Time Warping for Online Industrial Batch-End Quality Estimation. Industrial & Engineering Chemistry Research, 51(17), 6071-6084. doi:10.1021/ie2019068Zhang Y Edgar TF A robust dynamic time warping algorithm for batch trajectory synchronization 2008 2864 2869González-Martínez, J. M., Westerhuis, J. A., & Ferrer, A. (2013). Using warping information for batch process monitoring and fault classification. Chemometrics and Intelligent Laboratory Systems, 127, 210-217. doi:10.1016/j.chemolab.2013.07.003Kourti, T. (2003). Multivariate dynamic data modeling for analysis and statistical process control of batch processes, start-ups and grade transitions. Journal of Chemometrics, 17(1), 93-109. doi:10.1002/cem.778González-Martínez, J. M., Camacho, J., & Ferrer, A. (2013). Bilinear modeling of batch processes. Part III: parameter stability. Journal of Chemometrics, 28(1), 10-27. doi:10.1002/cem.2562Camacho, J., & Ferrer, A. (2014). Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects. Chemometrics and Intelligent Laboratory Systems, 131, 37-50. doi:10.1016/j.chemolab.2013.12.003Arteaga, F., & Ferrer, A. (2002). Dealing with missing data in MSPC: several methods, different interpretations, some examples. Journal of Chemometrics, 16(8-10), 408-418. doi:10.1002/cem.750Lei, F., Rotbøll, M., & Jørgensen, S. B. (2001). A biochemically structured model for Saccharomyces cerevisiae. Journal of Biotechnology, 88(3), 205-221. doi:10.1016/s0168-1656(01)00269-3Camacho J González-Martínez JM Ferrer A Multi-phase (MP) toolbox 2013 http://mseg.webs.upv.es/Software.htmlUMETRICS SIMCA 13.0.3 Umea, Sweden 2013 [email protected] www.umetrics.comGonzález-Martínez, J. M., Vitale, R., de Noord, O. E., & Ferrer, A. (2014). Effect of Synchronization on Bilinear Batch Process Modeling. Industrial & Engineering Chemistry Research, 53(11), 4339-4351. doi:10.1021/ie402052

    SensorSCAN: Self-Supervised Learning and Deep Clustering for Fault Diagnosis in Chemical Processes

    Full text link
    Modern industrial facilities generate large volumes of raw sensor data during the production process. This data is used to monitor and control the processes and can be analyzed to detect and predict process abnormalities. Typically, the data has to be annotated by experts in order to be used in predictive modeling. However, manual annotation of large amounts of data can be difficult in industrial settings. In this paper, we propose SensorSCAN, a novel method for unsupervised fault detection and diagnosis, designed for industrial chemical process monitoring. We demonstrate our model's performance on two publicly available datasets of the Tennessee Eastman Process with various faults. The results show that our method significantly outperforms existing approaches (+0.2-0.3 TPR for a fixed FPR) and effectively detects most of the process faults without expert annotation. Moreover, we show that the model fine-tuned on a small fraction of labeled data nearly reaches the performance of a SOTA model trained on the full dataset. We also demonstrate that our method is suitable for real-world applications where the number of faults is not known in advance. The code is available at https://github.com/AIRI-Institute/sensorscan

    Multivariate statistical data analysis of cell‐free protein synthesis toward monitoring and control

    Get PDF
    From Wiley via Jisc Publications RouterHistory: received 2020-09-14, rev-recd 2021-01-28, accepted 2021-03-06, pub-electronic 2021-03-23, pub-print 2021-06Article version: VoRPublication status: PublishedFunder: UK Engineering and Physical Sciences Research Council (EPSRC); Id: http://dx.doi.org/10.13039/501100000266; Grant(s): EP/P006485/1Abstract: The optimization and control of cell free protein synthesis (CFPS) presents an ongoing challenge due to the complex synergies and nonlinearities that cannot be fully explained in first principle models. This article explores the use of multivariate statistical tools for analyzing data sets collected from the CFPS of Cereulide monoclonal antibodies. During the collection of these data sets, several of the process parameters were modified to investigate their effect on the end‐point product (yield). Through the application of principal component analysis and partial least squares (PLS), important correlations in the process could be identified. For example, yield had a positive correlation with pH and NH3 and a negative correlation with CO2 and dissolved oxygen. It was also found that PLS was able to provide a long‐term prediction of product yield. The presented work illustrates that multivariate statistical techniques provide important insights that can help support the operation and control of CFPS processes
    corecore