395 research outputs found

    Generalized Multi-manifold Graph Ensemble Embedding for Multi-View Dimensionality Reduction

    Get PDF
    In this paper, we propose a new dimension reduction (DR) algorithm called ensemble graph-based locality preserving projections (EGLPP); to overcome the neighborhood size k sensitivity in locally preserving projections (LPP). EGLPP constructs a homogeneous ensemble of adjacency graphs by varying neighborhood size k and finally uses the integrated embedded graph to optimize the low-dimensional projections. Furthermore, to appropriately handle the intrinsic geometrical structure of the multi-view data and overcome the dimensionality curse, we propose a generalized multi-manifold graph ensemble embedding framework (MLGEE). MLGEE aims to utilize multi-manifold graphs for the adjacency estimation with automatically weight each manifold to derive the integrated heterogeneous graph. Experimental results on various computer vision databases verify the effectiveness of proposed EGLPP and MLGEE over existing comparative DR methods

    On the asymptotics of dimers on tori

    Full text link
    We study asymptotics of the dimer model on large toric graphs. Let L\mathbb L be a weighted Z2\mathbb{Z}^2-periodic planar graph, and let Z2E\mathbb{Z}^2 E be a large-index sublattice of Z2\mathbb{Z}^2. For L\mathbb L bipartite we show that the dimer partition function on the quotient L/(Z2E)\mathbb{L}/(\mathbb{Z}^2 E) has the asymptotic expansion exp[Af0+fsc+o(1)]\exp[A f_0 + \text{fsc} + o(1)], where AA is the area of L/(Z2E)\mathbb{L}/(\mathbb{Z}^2 E), f0f_0 is the free energy density in the bulk, and fsc\text{fsc} is a finite-size correction term depending only on the conformal shape of the domain together with some parity-type information. Assuming a conjectural condition on the zero locus of the dimer characteristic polynomial, we show that an analogous expansion holds for L\mathbb{L} non-bipartite. The functional form of the finite-size correction differs between the two classes, but is universal within each class. Our calculations yield new information concerning the distribution of the number of loops winding around the torus in the associated double-dimer models.Comment: 48 pages, 18 figure

    Topology of real cubic fourfolds

    Get PDF
    A solution to the problem of topological classification of real cubic fourfolds is presented. It is shown that the real locus of a real non-singular cubic fourfold is obtained from a projective 4-space either by adding several trivial one- and two-handles, or by adding a spherical connected component.Comment: 28 pages, 6 figure

    Noisy multi-label semi-supervised dimensionality reduction

    Get PDF
    Noisy labeled data represent a rich source of information that often are easily accessible and cheap to obtain, but label noise might also have many negative consequences if not accounted for. How to fully utilize noisy labels has been studied extensively within the framework of standard supervised machine learning over a period of several decades. However, very little research has been conducted on solving the challenge posed by noisy labels in non-standard settings. This includes situations where only a fraction of the samples are labeled (semi-supervised) and each high-dimensional sample is associated with multiple labels. In this work, we present a novel semi-supervised and multi-label dimensionality reduction method that effectively utilizes information from both noisy multi-labels and unlabeled data. With the proposed Noisy multi-label semi-supervised dimensionality reduction (NMLSDR) method, the noisy multi-labels are denoised and unlabeled data are labeled simultaneously via a specially designed label propagation algorithm. NMLSDR then learns a projection matrix for reducing the dimensionality by maximizing the dependence between the enlarged and denoised multi-label space and the features in the projected space. Extensive experiments on synthetic data, benchmark datasets, as well as a real-world case study, demonstrate the effectiveness of the proposed algorithm and show that it outperforms state-of-the-art multi-label feature extraction algorithms.Comment: 38 page

    Spectral Graph Embedding for Dimension Reduction in Financial Risk Assessment

    Get PDF
    The economic downturn in recent years has had a significant negative impact on corporates performance. In the last two years, as in the last years of 2010s, many companies have been influenced by the economic conditions and some have gone bankrupt. This has led to an increase in companies' financial risk. One of the significant branches of financial risk is the emph{company's credit risk}. Lenders and investors attach great importance to determining a company's credit risk when granting a credit facility. Credit risk means the possibility of default on repayment of facilities received by a company. There are various models for assessing credit risk using statistical models or machine learning. In this paper, we will investigate the machine learning task of the binary classification of firms into bankrupt and healthy based on the emph{spectral graph theory}. We first construct an emph{adjacency graph} from a list of firms with their corresponding emph{feature vectors}. Next, we first embed this graph into a one-dimensional Euclidean space and then into a two dimensional Euclidean space to obtain two lower-dimensional representations of the original data points. Finally, we apply the emph{support vector machine} and the emph{multi-layer perceptron} neural network techniques to proceed binary emph{node classification}. The results of the proposed method on the given dataset (selected firms of Tehran stock exchange market) show a comparative advantage over PCA method of emph{dimension reduction}. Finally, we conclude the paper with some discussions on further research directions

    Computation of Heterogeneous Object Co-embeddings from Relational Measurements

    Get PDF
    Dimensionality reduction and data embedding methods generate low dimensional representations of a single type of homogeneous data objects. In this work, we examine the problem of generating co-embeddings or pattern representations from two different types of objects within a joint common space of controlled dimensionality, where the only available information is assumed to be a set of pairwise relations or similarities between instances of the two groups. We propose a new method that models the embedding of each object type symmetrically to the other type, subject to flexible scale constraints and weighting parameters. The embedding generation relies on an efficient optimization dispatched using matrix decomposition, that is also extended to support multidimensional co-embeddings. We also propose a scheme of heuristically reducing the parameters of the model, and a simple way of measuring the conformity between the original object relations and the ones re-estimated from the co-embeddings, in order to achieve model selection by identifying the optimal model parameters with a simple search procedure. The capabilities of the proposed method are demonstrated with multiple synthetic and real-world datasets from the text mining domain. The experimental results and comparative analyses indicate that the proposed algorithm outperforms existing methods for co-embedding generation
    corecore