69 research outputs found
Online support vector machine application for model based fault detection and isolation of HVAC system
Abstract—Preventive maintenance plays an important role in Heating, Ventilation and Air Conditioning (HVAC) system. One cost effective strategy is the development of analytic fault detection and isolation (FDI) module by online monitoring the key variables of HAVC systems. This paper investigates realtime FDI for HAVC system by using online Support Vector Machine (SVM), by which we are able to train a FDI system with manageable complexity under real time working conditions. It is also proposed a new approach which allows us to detect unknown faults and updating the classifier by using these previously unknown faults. Based on the proposed approach, a semi unsupervised fault detection methodology has been developed for HVAC system
Training neural networks with structured noise improves classification and generalization
The beneficial role of noise in learning is nowadays a consolidated concept
in the field of artificial neural networks, suggesting that even biological
systems might take advantage of similar mechanisms to maximize their
performance. The training-with-noise algorithm proposed by Gardner and
collaborators is an emblematic example of a noise injection procedure in
recurrent networks, which are usually employed to model real neural systems. We
show how adding structure into noisy training data can substantially improve
the algorithm performance, allowing to approach perfect classification and
maximal basins of attraction. We also prove that the so-called Hebbian
unlearning rule coincides with the training-with-noise algorithm when noise is
maximal and data are fixed points of the network dynamics. A sampling scheme
for optimal noisy data is eventually proposed and implemented to outperform
both the training-with-noise and the Hebbian unlearning procedures.Comment: 21 pages, 17 figures, main text and appendice
Gaussian processes and SVM: Mean field results and leave-one-out
In this chapter, we elaborate on the well-known relationship between Gaussian processes (GP) and Support Vector Machines (SVM). Secondly, we present approximate solutions for two computational problems arising in GP and SVM. The first one is the calculation of the posterior mean for GP classifiers using a `naive' mean field approach. The second one is a leave-one-out estimator for the generalization error of SVM based on a linear response method. Simulation results on a benchmark dataset show similar performances for the GP mean field algorithm and the SVM algorithm. The approximate leave-one-out estimator is found to be in very good agreement with the exact leave-one-out error
Rejection-oriented learning without complete class information
Machine Learning is commonly used to support decision-making in numerous, diverse contexts. Its usefulness in this regard is unquestionable: there are complex systems built on the top of machine learning techniques whose descriptive and predictive capabilities go far beyond those of human beings. However, these systems still have limitations, whose analysis enable to estimate their applicability and confidence in various cases. This is interesting considering that abstention from the provision of a response is preferable to make a mistake in doing so. In the context of classification-like tasks, the indication of such inconclusive output is called rejection. The research which culminated in this thesis led to the conception, implementation and evaluation of rejection-oriented learning systems for two distinct tasks: open set recognition and data stream clustering. These system were derived from WiSARD artificial neural network, which had rejection modelling incorporated into its functioning. This text details and discuss such realizations. It also presents experimental results which allow assess the scientific and practical importance of the proposed state-of-the-art methodology.Aprendizado de Máquina é comumente usado para apoiar a tomada de decisão em numerosos e diversos contextos. Sua utilidade neste sentido é inquestionável: existem sistemas complexos baseados em técnicas de aprendizado de máquina cujas capacidades descritivas e preditivas vão muito além das dos seres humanos. Contudo, esses sistemas ainda possuem limitações, cuja análise permite estimar sua aplicabilidade e confiança em vários casos. Isto é interessante considerando que a abstenção da provisão de uma resposta é preferÃvel a cometer um equÃvoco ao realizar tal ação. No contexto de classificação e tarefas similares, a indicação desse resultado inconclusivo é chamada de rejeição. A pesquisa que culminou nesta tese proporcionou a concepção, implementação e avaliação de sistemas de aprendizado orientados `a rejeição para duas tarefas distintas: reconhecimento em cenário abertos e agrupamento de dados em fluxo contÃnuo. Estes sistemas foram derivados da rede neural artificial WiSARD, que teve a modelagem de rejeição incorporada a seu funcionamento. Este texto detalha e discute tais realizações. Ele também apresenta resultados experimentais que permitem avaliar a importância cientÃfica e prática da metodologia de ponta proposta
Automatic detection of pathological regions in medical images
Medical images are an essential tool in the daily clinical routine for the detection, diagnosis, and monitoring of diseases. Different imaging modalities such as magnetic resonance (MR) or X-ray imaging are used to visualize the manifestations of various diseases, providing physicians with valuable information. However, analyzing every single image by human experts is a tedious and laborious task. Deep learning methods have shown great potential to support this process, but many images are needed to train reliable neural networks. Besides the accuracy of the final method, the interpretability of the results is crucial for a deep learning method to be established. A fundamental problem in the medical field is the availability of sufficiently large datasets due to the variability of different imaging techniques and their configurations.
The aim of this thesis is the development of deep learning methods for the automatic identification of anomalous regions in medical images. Each method is tailored to the amount and type of available data. In the first step, we present a fully supervised segmentation method based on denoising diffusion models. This requires a large dataset with pixel-wise manual annotations of the pathological regions. Due to the implicit ensemble characteristic, our method provides uncertainty maps to allow interpretability of the model’s decisions. Manual pixel-wise annotations face the problems that they are prone to human bias, hard to obtain, and often even unavailable. Weakly supervised methods avoid these issues by only relying on image-level annotations. We present two different approaches based on generative models to generate pixel-wise anomaly maps using only image-level annotations, i.e., a generative adversarial network and a denoising diffusion model. Both perform image-to-image translation between a set of healthy and a set of diseased subjects. Pixel-wise anomaly maps can be obtained by computing the difference between the original image of the diseased subject and the synthetic image of its healthy representation. In an extension of the diffusion-based anomaly detection method, we present a flexible framework to solve various image-to-image translation tasks. With this method, we managed to change the size of tumors in MR images, and we were able to add realistic pathologies to images of healthy subjects.
Finally, we focus on a problem frequently occurring when working with MR images: If not enough data from one MR scanner are available, data from other scanners need to be considered. This multi-scanner setting introduces a bias between the datasets of different scanners, limiting the performance of deep learning models. We present a regularization strategy on the model’s latent space to overcome the problems raised by this multi-site setting
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data
With the increasing volume of high-frequency data in the information age,
both challenges and opportunities arise in the prediction of stock volatility.
On one hand, the outcome of prediction using tradition method combining stock
technical and macroeconomic indicators still leaves room for improvement; on
the other hand, macroeconomic indicators and peoples' search record on those
search engines affecting their interested topics will intuitively have an
impact on the stock volatility. For the convenience of assessment of the
influence of these indicators, macroeconomic indicators and stock technical
indicators are then grouped into objective factors, while Baidu search indices
implying people's interested topics are defined as subjective factors. To align
different frequency data, we introduce GARCH-MIDAS model. After mixing all the
above data, we then feed them into Transformer model as part of the training
data. Our experiments show that this model outperforms the baselines in terms
of mean square error. The adaption of both types of data under Transformer
model significantly reduces the mean square error from 1.00 to 0.86.Comment: Accepted by the 7th APWeb-WAIM International Joint Conference on Web
and Big Data. (APWeb 2023
- …