67,711 research outputs found

    Representation Learning for Anomaly Detection: From the Aspects of Data Views and Optimization

    Full text link
    University of Technology Sydney. Faculty of Engineering and Information Technology.Anomaly detection is a challenging task in realistic applications. However, there exist major issues for anomaly detection: (1) While the traditional problem setting focuses on data of single view, we have to deal with data of multiple views in many practical scenarios. Current methods often rely on assumptions on data distribution, which limits their flexibility and application. There is a lack of more effective and flexible methods for both semi-supervised and unsupervised multi-view anomaly detection. (2) Deep neural networks (DNN) have been widely applied for detecting anomalies. However, the mainstream optimization and learning strategies in DNN makes it easily fit both normal and anomalous data, resulting in an unsatisfactory performance and less reliability. There is a need for more advanced and reliable optimization framework for anomaly detection. In this thesis, we propose innovative deep representation learning models to tackle anomaly detection problems from aspects of multi-view data and model optimization. We first introduce related work and literature review. In following chapters, we investigate semi-supervised multi-view anomaly detection via variational generative model, unsupervised multi-view anomaly detection by exploring latent spaces, and advanced optimization strategy for general unsupervised anomaly detection respectively

    Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective

    Get PDF
    This paper takes a problem-oriented perspective and presents a comprehensive review of transfer learning methods, both shallow and deep, for cross-dataset visual recognition. Specifically, it categorises the cross-dataset recognition into seventeen problems based on a set of carefully chosen data and label attributes. Such a problem-oriented taxonomy has allowed us to examine how different transfer learning approaches tackle each problem and how well each problem has been researched to date. The comprehensive problem-oriented review of the advances in transfer learning with respect to the problem has not only revealed the challenges in transfer learning for visual recognition, but also the problems (e.g. eight of the seventeen problems) that have been scarcely studied. This survey not only presents an up-to-date technical review for researchers, but also a systematic approach and a reference for a machine learning practitioner to categorise a real problem and to look up for a possible solution accordingly

    Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data

    Full text link
    There are threefold challenges in emotion recognition. First, it is difficult to recognize human's emotional states only considering a single modality. Second, it is expensive to manually annotate the emotional data. Third, emotional data often suffers from missing modalities due to unforeseeable sensor malfunction or configuration issues. In this paper, we address all these problems under a novel multi-view deep generative framework. Specifically, we propose to model the statistical relationships of multi-modality emotional data using multiple modality-specific generative networks with a shared latent space. By imposing a Gaussian mixture assumption on the posterior approximation of the shared latent variables, our framework can learn the joint deep representation from multiple modalities and evaluate the importance of each modality simultaneously. To solve the labeled-data-scarcity problem, we extend our multi-view model to semi-supervised learning scenario by casting the semi-supervised classification problem as a specialized missing data imputation task. To address the missing-modality problem, we further extend our semi-supervised multi-view model to deal with incomplete data, where a missing view is treated as a latent variable and integrated out during inference. This way, the proposed overall framework can utilize all available (both labeled and unlabeled, as well as both complete and incomplete) data to improve its generalization ability. The experiments conducted on two real multi-modal emotion datasets demonstrated the superiority of our framework.Comment: arXiv admin note: text overlap with arXiv:1704.07548, 2018 ACM Multimedia Conference (MM'18
    • …
    corecore