26,952 research outputs found
Data-driven Soft Sensors in the Process Industry
In the last two decades Soft Sensors established themselves as a valuable alternative to the traditional means for the acquisition of critical process variables, process monitoring and other tasks which are related to process control. This paper discusses characteristics of the process industry data which are critical for the development of data-driven Soft Sensors. These characteristics are common to a large number of process industry fields, like the chemical industry, bioprocess industry, steel industry, etc. The focus of this work is put on the data-driven Soft Sensors because of their growing popularity, already demonstrated usefulness and huge, though yet not completely realised, potential. A comprehensive selection of case studies covering the three most important Soft Sensor application fields, a general introduction to the most popular Soft Sensor modelling techniques as well as a discussion of some open issues in the Soft Sensor development and maintenance and their possible solutions are the main contributions of this work
Use of easy measurable phenotypic traits as a complementary approach to evaluate the population structure and diversity in a high heterozygous panel of tetraploid clones and cultivars
Diversity in crops is fundamental for plant breeding efforts. An accurate assessment of genetic diversity, using molecular markers, such as single nucleotide polymorphism (SNP), must be able to reveal the structure of the population under study. A characterization of population structure using easy measurable phenotypic traits could be a preliminary and low-cost approach to elucidate the genetic structure of a population. A potato population of 183 genotypes was evaluated using 4859 high-quality SNPs and 19 phenotypic traits commonly recorded in potato breeding programs. A Bayesian approach, Minimum Spanning Tree (MST) and diversity estimator, as well as multivariate analysis based on phenotypic traits, were adopted to assess the population structure. Results: Analysis based on molecular markers showed groups linked to the phylogenetic relationship among the germplasm as well as the link with the breeding program that provided the material. Diversity estimators consistently structured the population according to a priori group estimation. The phenotypic traits only discriminated main groups with contrasting characteristics, as different subspecies, ploidy level or membership in a breeding program, but were not able to discriminate within groups. A joint molecular and phenotypic characterization analysis discriminated groups based on phenotypic classification, taxonomic category, provenance source of genotypes and genetic background. Conclusions: This paper shows the significant level of diversity existing in a parental population of potato as well as the putative phylogenetic relationships among the genotypes. The use of easily measurable phenotypic traits among highly contrasting genotypes could be a reasonable approach to estimate population structure in the initial phases of a potato breeding program.Fil: Tagliotti, Martin Enrique. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mar del Plata; Argentina. Instituto Nacional de Tecnología Agropecuaria. Centro Regional Buenos Aires Sur. Estación Experimental Agropecuaria Balcarce; ArgentinaFil: Deperi, Sofía Irene. Instituto Nacional de Tecnología Agropecuaria. Centro Regional Buenos Aires Sur. Estación Experimental Agropecuaria Balcarce; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mar del Plata; ArgentinaFil: Bedogni, María Cecilia. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Mar del Plata; Argentina. Instituto Nacional de Tecnología Agropecuaria. Centro Regional Buenos Aires Sur. Estación Experimental Agropecuaria Balcarce; ArgentinaFil: Zhang, Ruofang. Inner Mongolia University; ChileFil: Manrique Carpintero, Norma C.. Michigan State University; Estados UnidosFil: Coombs, Joseph. Michigan State University; Estados UnidosFil: Douches, David. Michigan State University; Estados UnidosFil: Huarte, Marcelo Atilio. Instituto Nacional de Tecnología Agropecuaria. Centro Regional Buenos Aires Sur. Estación Experimental Agropecuaria Balcarce; Argentin
Application of Computational Intelligence Techniques to Process Industry Problems
In the last two decades there has been a large progress in the computational
intelligence research field. The fruits of the effort spent on the research in the discussed
field are powerful techniques for pattern recognition, data mining, data modelling, etc.
These techniques achieve high performance on traditional data sets like the UCI
machine learning database. Unfortunately, this kind of data sources usually represent
clean data without any problems like data outliers, missing values, feature co-linearity,
etc. common to real-life industrial data. The presence of faulty data samples can have
very harmful effects on the models, for example if presented during the training of the
models, it can either cause sub-optimal performance of the trained model or in the worst
case destroy the so far learnt knowledge of the model. For these reasons the application
of present modelling techniques to industrial problems has developed into a research
field on its own. Based on the discussion of the properties and issues of the data and the
state-of-the-art modelling techniques in the process industry, in this paper a novel
unified approach to the development of predictive models in the process industry is
presented
Cosmological Density and Power Spectrum from Peculiar Velocities: Nonlinear Corrections and PCA
We allow for nonlinear effects in the likelihood analysis of galaxy peculiar
velocities, and obtain ~35%-lower values for the cosmological density parameter
Om and the amplitude of mass-density fluctuations. The power spectrum in the
linear regime is assumed to be a flat LCDM model (h=0.65, n=1, COBE) with only
Om as a free parameter. Since the likelihood is driven by the nonlinear regime,
we "break" the power spectrum at k_b=0.2 h/Mpc and fit a power law at k>k_b.
This allows for independent matching of the nonlinear behavior and an unbiased
fit in the linear regime. The analysis assumes Gaussian fluctuations and
errors, and a linear relation between velocity and density. Tests using proper
mock catalogs demonstrate a reduced bias and a better fit. We find for the
Mark3 and SFI data Om_m=0.32+-0.06 and 0.37+-0.09 respectively, with
sigma_8*Om^0.6 = 0.49+-0.06 and 0.63+-0.08, in agreement with constraints from
other data. The quoted 90% errors include cosmic variance. The improvement in
likelihood due to the nonlinear correction is very significant for Mark3 and
moderately so for SFI. When allowing deviations from LCDM, we find an
indication for a wiggle in the power spectrum: an excess near k=0.05 and a
deficiency at k=0.1 (cold flow). This may be related to the wiggle seen in the
power spectrum from redshift surveys and the second peak in the CMB anisotropy.
A chi^2 test applied to modes of a Principal Component Analysis (PCA) shows
that the nonlinear procedure improves the goodness of fit and reduces a spatial
gradient of concern in the linear analysis. The PCA allows addressing spatial
features of the data and fine-tuning the theoretical and error models. It shows
that the models used are appropriate for the cosmological parameter estimation
performed. We address the potential for optimal data compression using PCA.Comment: 18 pages, LaTex, uses emulateapj.sty, ApJ in press (August 10, 2001),
improvements to text and figures, updated reference
Self-Consistent Asset Pricing Models
We discuss the foundations of factor or regression models in the light of the
self-consistency condition that the market portfolio (and more generally the
risk factors) is (are) constituted of the assets whose returns it is (they are)
supposed to explain. As already reported in several articles, self-consistency
implies correlations between the return disturbances. As a consequence, the
alpha's and beta's of the factor model are unobservable. Self-consistency leads
to renormalized beta's with zero effective alpha's, which are observable with
standard OLS regressions. Analytical derivations and numerical simulations show
that, for arbitrary choices of the proxy which are different from the true
market portfolio, a modified linear regression holds with a non-zero value
at the origin between an asset 's return and the proxy's return.
Self-consistency also introduces ``orthogonality'' and ``normality'' conditions
linking the beta's, alpha's (as well as the residuals) and the weights of the
proxy portfolio. Two diagnostics based on these orthogonality and normality
conditions are implemented on a basket of 323 assets which have been components
of the S&P500 in the period from Jan. 1990 to Feb. 2005. These two diagnostics
show interesting departures from dynamical self-consistency starting about 2
years before the end of the Internet bubble. Finally, the factor decomposition
with the self-consistency condition derives a risk-factor decomposition in the
multi-factor case which is identical to the principal components analysis
(PCA), thus providing a direct link between model-driven and data-driven
constructions of risk factors.Comment: 36 pages with 8 figures. large version with 6 appendices for the
Proceedings of the 5th International Conference APFS (Applications of Physics
in Financial Analysis), June 29-July 1, 2006, Torin
Gating Artificial Neural Network Based Soft Sensor
This work proposes a novel approach to Soft Sensor modelling,
where the Soft Sensor is built by a set of experts which are artificial
neural networks with randomly generated topology. For each of
the experts a meta neural network is trained, the gating Artificial Neural
Network. The role of the gating network is to learn the performance of the
experts in dependency on the input data samples. The final prediction
of the Soft Sensor is a weighted sum of the individual experts predictions.
The proposed meta-learning method is evaluated on two different
process industry data sets
- …