Search CORE

37,005 research outputs found

Wind turbine condition monitoring strategy through multiway PCA and multivariate inference

Author: Pozo Montero Francesc
Salgado Óscar
Vidal Seguí Yolanda
Publication venue: 'MDPI AG'
Publication date: 01/01/2018
Field of study

This article states a condition monitoring strategy for wind turbines using a statistical data-driven modeling approach by means of supervisory control and data acquisition (SCADA) data. Initially, a baseline data-based model is obtained from the healthy wind turbine by means of multiway principal component analysis (MPCA). Then, when the wind turbine is monitorized, new data is acquired and projected into the baseline MPCA model space. The acquired SCADA data are treated as a random process given the random nature of the turbulent wind. The objective is to decide if the multivariate distribution that is obtained from the wind turbine to be analyzed (healthy or not) is related to the baseline one. To achieve this goal, a test for the equality of population means is performed. Finally, the results of the test can determine that the hypothesis is rejected (and the wind turbine is faulty) or that there is no evidence to suggest that the two means are different, so the wind turbine can be considered as healthy. The methodology is evaluated on a wind turbine fault detection benchmark that uses a 5 MW high-fidelity wind turbine model and a set of eight realistic fault scenarios. It is noteworthy that the results, for the presented methodology, show that for a wide range of significance, a in [1%, 13%], the percentage of correct decisions is kept at 100%; thus it is a promising tool for real-time wind turbine condition monitoring.Peer ReviewedPostprint (published version

Multidisciplinary Digital Publishing Institute

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Directory of Open Access Journals

Enrichment Procedures for Soft Clusters: A Statistical Test and its Applications

Author: Phillips Rhonda D.
Ramakrishnan Naren
Watson Layne T.
Wynne Randolph H.
Publication venue
Publication date: 01/02/2010
Field of study

Clusters, typically mined by modeling locality of attribute spaces, are often evaluated for their ability to demonstrate ‘enrichment’ of categorical features. A cluster enrichment procedure evaluates the membership of a cluster for significant representation in pre-defined categories of interest. While classical enrichment procedures assume a hard clustering deﬁnition, in this paper we introduce a new statistical test that computes enrichments for soft clusters. We demonstrate an application of this test in reﬁning and evaluating soft clusters for classification of remotely sensed images

Computer Science Technical Reports @Virginia Tech

Semantic Information G Theory and Logical Bayesian Inference for Machine Learning

Author: Lu Chenguang
Publication venue
Publication date: 01/01/2019
Field of study

An important problem with machine learning is that when label number n\u3e2, it is very difficult to construct and optimize a group of learning functions, and we wish that optimized learning functions are still useful when prior distribution P(x) (where x is an instance) is changed. To resolve this problem, the semantic information G theory, Logical Bayesian Inference (LBI), and a group of Channel Matching (CM) algorithms together form a systematic solution. MultilabelMultilabel A semantic channel in the G theory consists of a group of truth functions or membership functions. In comparison with likelihood functions, Bayesian posteriors, and Logistic functions used by popular methods, membership functions can be more conveniently used as learning functions without the above problem. In Logical Bayesian Inference (LBI), every label’s learning is independent. For Multilabel learning, we can directly obtain a group of optimized membership functions from a big enough sample with labels, without preparing different samples for different labels. A group of Channel Matching (CM) algorithms are developed for machine learning. For the Maximum Mutual Information (MMI) classification of three classes with Gaussian distributions on a two-dimensional feature space, 2-3 iterations can make mutual information between three classes and three labels surpass 99% of the MMI for most initial partitions. For mixture models, the Expectation-Maxmization (EM) algorithm is improved and becomes the CM-EM algorithm, which can outperform the EM algorithm when mixture ratios are imbalanced, or local convergence exists. The CM iteration algorithm needs to combine neural networks for MMI classifications on high-dimensional feature spaces. LBI needs further studies for the unification of statistics and logic

PhilPapers

Multidisciplinary Digital Publishing Institute

Comparison of Thematic Maps Using Symbolic Entropy

Author: Antonio Paez
Fernando A LÃ³pez
Manuel Ruiz
Publication venue
Publication date
Field of study

Comparison of thematic maps is an important task in a number of disciplines. Map comparison has traditionally been conducted using cell-by-cell agreement indicators, such as the Kappa measure. More recently, other methods have been proposed that take into account not only spatially coincident cells in two maps, but also their surroundings or the spatial structure of their differences. The objective of this paper is to propose a framework for map comparison that considers 1) the patterns of spatial association in two maps, in other words, the map elements in their surroundings; 2) the equivalence of those patterns; and 3) the independence of patterns between maps. Two new statistics for the spatial analysis of qualitative data are introduced. These statistics are based on the symbolic entropy of the maps, and function as measures of map compositional equivalence and independence. As well, all inferential elements to conduct hypothesis testing are developed. The framework is illustrated using real and synthetic maps. Key word: Thematic maps, map comparison, qualitative variables, spatial association, symbolic entropy, hypothesis tests

Research Papers in Economics