Skip to main content
Article thumbnail
Location of Repository

Data-driven Soft Sensors in the Process Industry

By Petr Kadlec, Bogdan Gabrys and Sibylle Strandt


In the last two decades Soft Sensors established themselves as a valuable alternative to the traditional means for the acquisition of critical process variables, process monitoring and other tasks which are related to process control. This paper discusses characteristics of the process industry data which are critical for the development of data-driven Soft Sensors. These characteristics are common to a large number of process industry fields, like the chemical industry, bioprocess industry, steel industry, etc. The focus of this work is put on the data-driven Soft Sensors because of their growing popularity, already demonstrated usefulness and huge, though yet not completely realised, potential. A comprehensive selection of case studies covering the three most important Soft Sensor application fields, a general introduction to the most popular Soft Sensor modelling techniques as well as a discussion of some open issues in the Soft Sensor development and maintenance and their possible solutions are the main contributions of this work

Topics: aintel, csi
Year: 2009
OAI identifier:

Suggested articles


  1. (2004a) PCA+ANFIS OP polymeric-coated substrate anchorage Cont.
  2. (2004a) RNN OP biomass concentration prediction Batch Chen et al. (2004b) RNN OP melt-flow-length prediction in injection molding process Cont.
  3. (1997). (2004b) PLS, PCA OP simulated distillation column Batch Dayal and MacGregor
  4. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. doi
  5. (2006). A method for predicting quality of the crude oil distillation. In: Evolving Fuzzy Systems, doi
  6. (2005). A new fault diagnosis method using fault directions in fisher discriminant analysis. doi
  7. (2000). A nonlinear soft sensor based on multivariate smoothing procedure for quality estimation in distillation columns. doi
  8. (2002). A perspective view and survey of meta-learning.
  9. (2003). A review of process fault detection and diagnosis part i: Quantitative model-based methods. doi
  10. (2003). A review of process fault detection and diagnosis part ii: Qualitative models and search strategies. doi
  11. (2003). A review of process fault detection and diagnosis part iii: Process history based methods. doi
  12. (1997). A self-organizing neural-network-based fuzzy system. Artificial Neural Networks, doi
  13. (2000). A self-tuning adaptive control applied to an industrial large scale ethanol production. doi
  14. (2003). A soft sensor modeling approach using support vector machines. American Control Conference, doi
  15. (2004). A soft-sensor development for melt-flow-length measurement during injection mold filling. doi
  16. (1993). A statistical view of some chemometrics regression tools. doi
  17. (1995). A study of cross-validation and bootstrap for accuracy estimation 67and model selection.
  18. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection.
  19. (2004). A survey of outlier detection methodologies. doi
  20. (2007). A systematic approach for soft sensor development. doi
  21. (1998). Adaptive batch monitoring using hierarchical pca. doi
  22. (2008). Adaptive local learning soft sensor for inferential control support. In: CIMCA doi
  23. (2004). Adaptive moving window mpca for online batch monitoring. In: doi
  24. (2006). Adaptive multivariate statistical process control for monitoring time-varying processes. doi
  25. (2004). Ahu sensor fault diagnosis using principal component analysis method. doi
  26. (2001). An adaptive neuro-fuzzy inference system as a soft sensor for viscosity in rubber mixing process.
  27. (1999). An empirical comparison of voting classification algorithms: Bagging, boosting, and variants.
  28. (2001). An introduction to the kalman filter. An introduction to the kalman filter. doi
  29. (2003). An introduction to variable and feature selection. doi
  30. (2000). An overview of classifier fusion methods.
  31. (1995). Analysis, monitoring and fault diagnosis of batch processes using multiblock and multiway pls. doi
  32. (2001). ANFIS OP rubber viscosity estimation Cont.
  33. (2005). Application issues of industrial soft computing systems. Fuzzy Information Processing Society, doi
  34. (2002). Application of feedforward neural networks for soft sensors in the sugar industry. In: Neural Networks, doi
  35. (2003). Application of steady-state detection method based on wavelet transform. doi
  36. (2004). Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems. doi
  37. (1996). Bagging predictors. doi
  38. (2003). Business process management: The third waveBusiness process management: The third wave.
  39. (2005). Classifier selection for majority voting. doi
  40. (2004). Combining Pattern Classifiers: Methods and Algorithms. doi
  41. (2006). Combining process and spectroscopic data to improve batch modeling. doi
  42. (2002). Comparative study of black-box and hybrid estimation methods in fed-batch fermentation. doi
  43. (2005). Competitive advantages of evolutionary computation for industrial applications. In: Evolutionary Computation, doi
  44. (1991). Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems.
  45. (1979). Control procedures for residuals associated with principal component analysis. doi
  46. (2002). Critical evaluation of approaches for on-line batch process monitoring. doi
  47. (2004). Data-driven modeling of batch processes. In: doi
  48. (2001). Dealing with missing data part i. doi
  49. (2001). Dealing with missing data: Part ii. doi
  50. (2002). Dealing with missing data.
  51. (2007). Dealing with missing values and outliers in principal component analysis. doi
  52. (2006). Developing soft sensors using hybrid soft computing methodology: a neurofuzzy system based on rough set theory and genetic algorithms. doi
  53. (2004). Development of a hybrid pca-anfis measurement system for monitoring product quality in the coating industry. In: Systems, Man and Cybernetics, doi
  54. (2004). Development of a soft sensor for a batch distillation column using linear and nonlinear pls regression techniques. doi
  55. (1995). Emission monitoring using multivariate soft sensors. In: American Control Conference, doi
  56. (2002). Ensembles of learning machines. doi
  57. (2004). Estimating product composition profiles in batch distillation via partial least squares regression. doi
  58. (2000). Ethylene compressor monitoring using model-based pca. doi
  59. (2005). Evolving computational intelligence systems. In: doi
  60. (2006). evolving NFS OP crude oil distillation in refinery process Cont.
  61. (2001). Exploring process data. doi
  62. (1996). Fault detection and diagnosis with the help of fuzzy-logic and with application to a laboratory turbogenerator.
  63. (2008). Fault detection and isolation of an on-line analyzer for an ethylene cracking process. doi
  64. (2004). Flexible models with evolving structure. doi
  65. (2000). FPM+RBFN OP microbial population in a bioreactor Batch Rao
  66. (2003). Functional nodes in dynamic neural networks for bioprocess modelling.
  67. (1996). Fuzzy sets. World Scientific Series In
  68. Gabrys (2008b) MLPs ensemble OP industrial drier Cont.
  69. (2003). generalised ANN OP diacetyl concentration prediction Batch Lin et al.
  70. (2006). Genetic algorithms in classifier fusion. doi
  71. (2004). Hybrid intelligent systems for industrial data analysis. doi
  72. (2003). Hybrid model development methodology for industrial soft sensors. In: doi
  73. (2000). Hybrid modelling of biotechnological processes using neural networks. doi
  74. (2002). Identification of evolving fuzzy rule-based models. doi
  75. (2001). Industrial applications of soft computing: A review. doi
  76. (2002). Industrial use of multivariate statistical analysis for process monitoring and control. doi
  77. (2004). Integrated condition monitoring and control of fed76 batch fermentation processes. doi
  78. (2004). Integrated condition monitoring and control of fed76batch fermentation processes. doi
  79. (2002). Intelligent integrated plant operation system for six sigma. doi
  80. (1998). Joint diagnosis of process and sensor faults using principal component analysis. doi
  81. (2004). Learning hybrid neuro-fuzzy classifier models from data: To combine or not to combine? Fuzzy Sets and Systems 147, doi
  82. (1996). Learning in the presence of concept drift and hidden 75contexts. doi
  83. (1996). Learning in the presence of concept drift and hidden contexts. doi
  84. (2004). Learning with drift detection. doi
  85. (2008). Learnt topology gating artificial neural network. doi
  86. (2003). Local models for soft-sensors in a rougher flotation bank. doi
  87. (1997). Locally weighted learning. doi
  88. (2003). LS-SVM OP gasoline absorbing rate in FCC Cont.
  89. (2004). LS-SVM OP light diesel freezing point detection in FCC Cont.
  90. (2002). Missing data: Our view of the state of the art. doi
  91. (2005). MLP OP C4 and C5 concentration prediction in a debutanizer refinery process Cont.
  92. (2002). MLP OP sugar quality estimation Cont.
  93. (2000). MLP, FPM, eKF OP biomass estimation in a fermentation process Batch Qin
  94. (2002). MLP, RBFN, Hybrid (MLP/RBFN+FPM) OP biomass concentration prediction Batch Su et al.
  95. (2006). MLP, RBFN, SVR OP two simulated biochemical processes Batch Wang et al.
  96. (1996). Model predictive control of a slurry polymerisation reactor. doi
  97. (2005). Modeling and identification for multirate systems. doi
  98. (1998). Modelling and diagnostics of batch processes and analogous kinetic experiments. doi
  99. (1994). Monitoring batch processes using multiway principal component analysis. doi
  100. (2003). Monitoring of a sequencing batch reactor using adaptive multiblock principal component analysis. doi
  101. (2004). Monitoring of batch processes through state-space models. doi
  102. (1998). Monitoring the process of curing of epoxy/graphite fiber composites with a recurrent neural network as a soft sensor. doi
  103. (2006). MPLS PM process end point detection Batch Dunia and Qin
  104. (2001). Multi-and Megavariate Data Analysis: Principles and Applications. doi
  105. (1995). Multi-way partial least squares in monitoring batch processes. doi
  106. (1987). Multi-way principal components and pls analysis. doi
  107. (1995). Multivariate spc charts for monitoring batch processes. doi
  108. (1998). Multivariate statistical analysis of an emulsion batch process. doi
  109. (1996). Multiway calibration. multilinear pls. doi
  110. (2000). Neural and adaptive systems. doi
  111. (1996). Neural fuzzy systems: a neuro-fuzzy synergism to intelligent 69systems. Prentice-Hall, Inc. Upper Saddle River,
  112. (1996). Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems. doi
  113. (2000). Neural network based fault diagnosis using unmeasurable inputs. doi
  114. (1995). Neural network ensembles, cross validation and active learning.
  115. (1999). Neural networks based decision support in presence of uncertainties. doi
  116. (1997). Neural networks for intelligent sensors and control - practical issues and some solutions. Neural Systems for Control, 213234Neural Systems for Control. doi
  117. (1995). Neural Networks for Pattern Recognition. doi
  118. (2000). Neural networks for the identification and control of blast furnace hot metal quality. doi
  119. (1997). Neuro-fuzzy and soft computing. 66Prentice Hall Upper Saddle River, NJ, Prentice Hall Upper Saddle River,
  120. (2002). Neuro-fuzzy approach to processing inputs with missing values in pattern recognition problems. doi
  121. (2004). NFS OP penicillin production bioprocess Batch Wang and Rong
  122. (1995). NLPCA+NNPLS OP NOx prediction in exhaust gas Cont.
  123. (1998). Nonlinear inferential control for process applications. doi
  124. (1992). Nonlinear pls modeling using neural networks. doi
  125. (1996). Nonlinear principal component analysis–based on principal curves and neural networks. doi
  126. (1998). On combining classifiers. doi
  127. (1989). On the approximate realization of continuous mappings by neural networks. doi
  128. (2003). On-line batch process monitoring using a consecutively updated multiway principal component analysis model. doi
  129. (2002). On-line batch process monitoring using dynamic pca and dynamic pls models. doi
  130. (1990). On-line estimation and adaptive control of bioreactors. doi
  131. (1999). Online outlier detection and removal.
  132. (2002). Outliers in process modeling and identification. Control Systems Technology, doi
  133. (2003). Partial least squares (pls) regression. Encyclopedia of Social Sciences, Research Methods. Thousand Oaks (CA): Sage (2003)Encyclopedia of Social Sciences, Research Methods. Thousand Oaks (CA): Sage doi
  134. (1998). Particle size distribution soft-sensor for a grinding circuit. doi
  135. (1997). PCA OP, SFD air emission monitoring Cont.
  136. (2004). PCA SFD+PM air handling unit Cont.
  137. (2008). PCA, SOM, RBFN PM, PFD ethylene cracking process Cont.
  138. (2000). PCA/PLS+LWR OP toluene composition in a splitter column, diesel temperature in crude oil column Cont.
  139. (2006). PCA+PLS PM Lumber drying Batch Zhang and Lennox
  140. (2001). Pls-regression: a basic tool of chemometrics. doi
  141. (1999). Popular ensemble methods: An empirical study.
  142. (2000). Predicting the performance of soft sensors as a route to low cost automation. doi
  143. (2008). Principal component analysis for data containing outliers and missing elements. doi
  144. (2002). Principal Component Analysis. doi
  145. (2002). Process analysis and abnormal situation detection: from theory to practice. Control Systems Magazine, doi
  146. (1999). Process monitoring and modeling using the selforganizing map.
  147. (2005). Process monitoring approach using fast moving window pca. doi
  148. (2002). Product property and production rate control of styrene polymerization. doi
  149. (2005). PSO+MLP OP ethylene distillation column Cont. Kalos et al.
  150. (2006). Radial basis function neural networksbased modeling of the membrane separation process: Hydrogen recovery from refinery gases. doi
  151. (2006). Real-time monitoring of an industrial batch process. doi
  152. (2001). Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability. doi
  153. (1997). Recursive exponentially weighted pls and its applications to adaptive control and prediction. doi
  154. (2000). Recursive pca for adaptive process monitoring. doi
  155. (1998). Recursive pls algorithms for adaptive data modeling. doi
  156. (1991). Regression on multivariate images: principal component regression for modeling, prediction and visual diagnostic tools. doi
  157. (1990). Regularization algorithms for learning that are equivalent to multilayer networks. doi
  158. (1997). RNN OP three simple simulated processes Cont.
  159. (2004). Robust inferential sensors based on ensemble of predictors generated by genetic programming. In: doi
  160. (1995). Robust principal components regression as a detection tool for outliers. doi
  161. (2002). Robust soft sensors based on integration of genetic programming, analytical neural networks, and support vector machines. In: Evolutionary Computation, doi
  162. (2000). RPCA PM rapid thermal annealing process Batch Nomikos and MacGregor (1995b) PCA PM polymerisation process Batch Rotem et al.
  163. (1998). RPLS OP research octane number prediction in a refinery process Cont.
  164. (1997). Self-organizing maps. doi
  165. (1997). Self-validating inferential sensors with application to air emission monitoring. doi
  166. (1996). Sensor fault identification and reconstruction using principal component analysis. In: doi
  167. (2004). Sensor fault identification based on timelagged pca in dynamic processes. doi
  168. (2005). Sensor-fault detection, diagnosis and estimation for centrifu74gal chiller systems using principal-component analysis method. doi
  169. (2006). Sequential adaptive fuzzy inference system (safis) for nonlinear system identification and prediction. Fuzzy Sets and Systems doi
  170. (1997). Soft sensing based on artificial neural network. In: doi
  171. (2004). Soft sensing modeling based on support vector machine and bayesian model selection. doi
  172. (2005). Soft sensing modeling via artificial neural network based on pso-alopex. doi
  173. (2000). Soft sensors development for on-line bioreactor state estimation. doi
  174. (2004). Soft sensors for on-line biomass measurements. doi
  175. (1999). Soft sensors for processing plants. Intelligent Processing and 65Manufacturing of Materials, doi
  176. (2005). Soft sensors for product quality monitoring in debutanizer distillation columns. doi
  177. (1993). Soft sensors for quality prediction in batch chemical pulping processes. In: Intelligent Control, doi
  178. (2006). Soft-sensor development for fed-batch bioreactors using support vector regression. doi
  179. (1997). Software sensors in bioprocess engineering.
  180. (1999). SOM PM, OP cont. pulp digester; steel production process; pulp and paper industry Cont.
  181. (1998). SRM (ARMAX) OP particle size estimation in a grinding plant Cont.
  182. (1992). Stacked generalization. doi
  183. (2004). Statistical and computational intelligence techniques for inferential model development: a comparative evaluation and a novel proposition for fusion. doi
  184. (2005). Statistical batch process monitoring using gray models. doi
  185. (1998). Statistical learning theory. doi
  186. (2005). Study on least squares support vector machines algorithm and its application. doi
  187. (1998). Subspace approach to multidimensional identification and reconstruction. doi
  188. (2000). System parameter estimation with input/output noisy data andmissing measurements. Signal Processing, doi
  189. (1979). The alopex process: Visual receptive fields by response feedback. doi
  190. (2001). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. doi
  191. (1931). The generalization of student’s ratio. doi
  192. (1993). The identification of multiple outliers. doi
  193. (1908). The probable error of a mean. doi
  194. (2004). TLPCA PFD, SFD polymerisation process Batch Wang and Cui
  195. (2006). Use of multivariate data analysis for lumber drying process monitoring and fault detection.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.