318 research outputs found

    Kernelized Supervised Dictionary Learning

    Get PDF
    The representation of a signal using a learned dictionary instead of predefined operators, such as wavelets, has led to state-of-the-art results in various applications such as denoising, texture analysis, and face recognition. The area of dictionary learning is closely associated with sparse representation, which means that the signal is represented using few atoms in the dictionary. Despite recent advances in the computation of a dictionary using fast algorithms such as K-SVD, online learning, and cyclic coordinate descent, which make the computation of a dictionary from millions of data samples computationally feasible, the dictionary is mainly computed using unsupervised approaches such as k-means. These approaches learn the dictionary by minimizing the reconstruction error without taking into account the category information, which is not optimal in classification tasks. In this thesis, we propose a supervised dictionary learning (SDL) approach by incorporating information on class labels into the learning of the dictionary. To this end, we propose to learn the dictionary in a space where the dependency between the signals and their corresponding labels is maximized. To maximize this dependency, the recently-introduced Hilbert Schmidt independence criterion (HSIC) is used. The learned dictionary is compact and has closed form; the proposed approach is fast. We show that it outperforms other unsupervised and supervised dictionary learning approaches in the literature on real-world data. Moreover, the proposed SDL approach has as its main advantage that it can be easily kernelized, particularly by incorporating a data-driven kernel such as a compression-based kernel, into the formulation. In this thesis, we propose a novel compression-based (dis)similarity measure. The proposed measure utilizes a 2D MPEG-1 encoder, which takes into consideration the spatial locality and connectivity of pixels in the images. The proposed formulation has been carefully designed based on MPEG encoder functionality. To this end, by design, it solely uses P-frame coding to find the (dis)similarity among patches/images. We show that the proposed measure works properly on both small and large patch sizes on textures. Experimental results show that by incorporating the proposed measure as a kernel into our SDL, it significantly improves the performance of a supervised pixel-based texture classification on Brodatz and outdoor images compared to other compression-based dissimilarity measures, as well as state-of-the-art SDL methods. It also improves the computation speed by about 40% compared to its closest rival. Eventually, we have extended the proposed SDL to multiview learning, where more than one representation is available on a dataset. We propose two different multiview approaches: one fusing the feature sets in the original space and then learning the dictionary and sparse coefficients on the fused set; and the other by learning one dictionary and the corresponding coefficients in each view separately, and then fusing the representations in the space of the dictionaries learned. We will show that the proposed multiview approaches benefit from the complementary information in multiple views, and investigate the relative performance of these approaches in the application of emotion recognition

    Advances in nonnegative matrix factorization with application on data clustering.

    Get PDF
    Clustering is an important direction in many ļ¬elds, e.g., machine learning, data mining and computer vision. It aims to divide data into groups (clusters) for the purposes of summarization or improved understanding. With the rapid development of new technology, high-dimensional data become very common in many real world applications, such as satellite returned large number of images, robot received real-time video streaming, large-scale text database and the mass of information on the social networks (i.e., Facebook, twitter), etc, however, most existing clustering approaches are heavily restricted by the large number of features, and tend to be ineļ¬ƒcient and even infeasible. In this thesis, we focus on ļ¬nding an optimal low dimensional representation of high-dimensional data, based nonnegative matrix factorization (NMF) framework, for better clustering. Speciļ¬cally, there are three methods as follows: - Multiple Components Based Representation Learning Real data are usually complex and contain various components. For example, face images have expressions and genders. Each component mainly reļ¬‚ects one aspect of data and provides information others do not have. Therefore, exploring the semantic information of multiple components as well as the diversity among them is of great beneļ¬t to understand data comprehensively and in-depth. To this end, we propose a novel multi-component nonnegative matrix factorization. Instead of seeking for only one representation of data, our approach learns multiple representations simultaneously, with the help of the Hilbert Schmidt Independence Criterion (HSIC) as a diversity term. HSIC explores the diverse information among the representations, where each representation corresponds to a component. By integrating the multiple representations, a more comprehensive representation is then established. Extensive experimental results on real-world datasets have shown that MCNMF not only achieves more accurate performance over the state-of-the-arts using the aggregated representation, but also interprets data from diļ¬€erent aspects with the multiple representations, which is beyond what current NMFs can oļ¬€er. - Ordered Structure Preserving Representation Learning Real-world applications often process data, such as motion sequences and video clips, are with ordered structure, i.e., consecutive neighbouring data samples are very likely share similar features unless a sudden change occurs. Therefore, traditional NMF assumes the data samples and features to be independently distributed, making it not proper for the analysis of such data. To overcome this limitation, a novel NMF approach is proposed to take full advantage of the ordered nature embedded in the sequential data to improve the accuracy of data representation. With a L2,1-norm based neighbour penalty term, ORNMF enforces the similarity of neighbouring data. ORNMF also adopts the L2,1-norm based loss function to improve its robustness against noises and outliers. Moreover, ORNMF can ļ¬nd the cluster boundaries and get the number of clusters without the number of clusters to be given beforehand. A new iterative up- dating optimization algorithm is derived to solve ORNMFā€™s objective function. The proofs of the convergence and correctness of the scheme are also presented. Experiments on both synthetic and real-world datasets have demonstrated the eļ¬€ectiveness of ORNMF. - Diversity Enhanced Multi-view Representation Learning Multi-view learning aims to explore the correlations of diļ¬€erent information, such as diļ¬€erent features or modalities to boost the performance of data analysis. Multi-view data are very common in many real world applications because data is often collected from diverse domains or obtained from diļ¬€erent feature extractors. For example, color and texture information can be utilized as diļ¬€erent kinds of features in images and videos. Web pages are also able to be represented using the multi-view features based on text and hyperlinks. Taken alone, these views will often be deļ¬cient or incomplete because diļ¬€erent views describe distinct perspectives of data. Therefore, we propose a Diverse Multi-view NMF approach to explore diverse information among multi-view representations for more comprehensive learning. With a novel diversity regularization term, DiNMF explicitly enforces the orthogonality of diļ¬€erent data representations. Importantly, DiNMF converges linearly and scales well with large-scale data. By taking into account the manifold structures, we further extend the approach under a graph-based model to preserve the locally geometrical structure of the manifolds for multi-view setting. Compared to other multi-view NMF methods, the enhanced diversity of both approaches reduce the redundancy between the multi-view representations, and improve the accuracy of the clustering results. - Constrained Multi-View Representation Learning To incorporate prior information for learning accurately, we propose a novel semi- supervised multi-view NMF approach, which considers both the label constraints as well as the multi-view consistence simultaneously. In particular, the approach guarantees that data sharing the same label will have the same new representation and be mapped into the same class in the low-dimensional space regardless whether they come from the same view. Moreover, diļ¬€erent from current NMF- based multi-view clustering methods that require the weight factor of each view to be speciļ¬ed individually, we introduce a single parameter to control the distribution of weighting factors for NMF-based multi-view clustering. Consequently, the weight factor of each view can be assigned automatically depending on the dissimilarity between each new representation matrix and the consensus matrix. Besides, Using the structured sparsity-inducing, L2,1-norm, our method is robust against noises and hence can achieve more stable clustering results

    Data fusion and matching by maximizing statistical dependencies

    Get PDF
    The core aim of machine learning is to make a computer program learn from the experience. Learning from data is usually defined as a task of learning regularities or patterns in data in order to extract useful information, or to learn the underlying concept. An important sub-field of machine learning is called multi-view learning where the task is to learn from multiple data sets or views describing the same underlying concept. A typical example of such scenario would be to study a biological concept using several biological measurements like gene expression, protein expression and metabolic profiles, or to classify web pages based on their content and the contents of their hyperlinks. In this thesis, novel problem formulations and methods for multi-view learning are presented. The contributions include a linear data fusion approach during exploratory data analysis, a new measure to evaluate different kinds of representations for textual data, and an extension of multi-view learning for novel scenarios where the correspondence of samples in the different views or data sets is not known in advance. In order to infer the one-to-one correspondence of samples between two views, a novel concept of multi-view matching is proposed. The matching algorithm is completely data-driven and is demonstrated in several applications such as matching of metabolites between humans and mice, and matching of sentences between documents in two languages.Koneoppimisessa pyritƤƤn luomaan tietokoneohjelmia, jotka oppivat kokemuksen kautta. TehtƤvƤnƤ on usein oppia tietoaineistoista sƤƤnnƶnmukaisuuksia joiden avulla saadaan uutta tietoa aineiston taustalla olevasta ilmiƶstƤ ja voidaan ymmƤrtƤƤ ilmiƶtƤ paremmin. ErƤs keskeinen koneoppimisen alahaara kƤsittelee oppimista useita samaa ilmiƶtƤ kƤsitteleviƤ tietoaineistoja yhdistelemƤllƤ. Tavoitteena voi olla esimerkiksi solutason biologisen ilmiƶn ymmƤrtƤminen tarkastelemalla geenien aktiivisuusmittauksia, proteiinien konsentraatioita ja metabolista aktiivisuutta samanaikaisesti. Toisena esimerkkinƤ verkkosivuja voidaan luokitella samanaikaisesti sekƤ niiden tekstisisƤllƶn ettƤ hyperlinkkirakenteen perusteella. TƤssƤ vƤitƶskirjassa esitellƤƤn uusia periaatteita ja menetelmiƤ useiden tietolƤhteiden yhdistelemiseen. Tyƶn pƤƤtuloksina esitellƤƤn lineaarinen tietoaineistojen yhdistelemismenetelmƤ tutkivaan analysiin, uusi menetelmƤ tekstiaineistojen erilaisten esitystapojen vertailuun sekƤ uusi yhdistelemisperiaate tilanteisiin joissa aineistojen nƤytteiden vastaavuutta toisiinsa ei tunneta ennalta. TyƶssƤ esitetƤƤn kuinka vastaavuus voidaan oppia tietoaineistoista itsestƤƤn, ilman ulkopuolista ohjausta. Uutta menetelmƤƤ sovelletaan tyƶssƤ esimerkiksi hakemaan vastaavuuksia ihmisten ja hiirten metaboliamittauksista sekƤ etsimƤƤn samaa merkitseviƤ lauseita kahdella eri kielellƤ kirjoitetuista teksteistƤ

    PSA 2016

    Get PDF
    These preprints were automatically compiled into a PDF from the collection of papers deposited in PhilSci-Archive in conjunction with the PSA 2016

    Pattern Recognition

    Get PDF
    Pattern recognition is a very wide research field. It involves factors as diverse as sensors, feature extraction, pattern classification, decision fusion, applications and others. The signals processed are commonly one, two or three dimensional, the processing is done in real- time or takes hours and days, some systems look for one narrow object class, others search huge databases for entries with at least a small amount of similarity. No single person can claim expertise across the whole field, which develops rapidly, updates its paradigms and comprehends several philosophical approaches. This book reflects this diversity by presenting a selection of recent developments within the area of pattern recognition and related fields. It covers theoretical advances in classification and feature extraction as well as application-oriented works. Authors of these 25 works present and advocate recent achievements of their research related to the field of pattern recognition

    Advances in Eating Disorders

    Get PDF
    Eating disorders (ED) are a group of mental disorders characterized by an altered food intake and the presence of inappropriate behaviors and thoughts about weight and shape. All EDs lead to physical and psychosocial functioning impairments in the patients which, in turn, may contribute to the persistence of the disease. The severity of EDs has been highlighted by their chronicity, medical complications, comorbidity, and high rates of mortality. Therefore, to address this important health issue, the current Special Issue collected 21 articles (i.e., three reviews and 18 research articles) focusing on the most recent and relevant scientific findings regarding advances in ED, such as genetic and epigenetic factors, biomarkers, comorbidity, clinical phenotypes, neurocognition, treatment predictors, and treatment models and therapeutic targets. Altogether, we believe that the articles contained in this Special Issue have largely achieved the initial objective of providing increased knowledge about the pathogenesis, the risk factors, the maintenance factors, and the most appropriate treatments tools for ED
    • ā€¦
    corecore