91,273 research outputs found

    Dealing with uncertainty: A rough-set-based approach with the background of classical logic

    Get PDF
    The representative-based approximation has been widely studied in rough set theory. Hence, rough set approximations can be defined by the system of representatives, which plays a crucial role in set approximation. In the authors’ previous research a possible use of the similarity-based rough set in first-order logic was investigated. Now our focus has changed to representative-based approximation systems. In this article, the authors show a logical system relying on representative-based set approximation. In our approach, a three-valued partial logic system is introduced. Based on the properties of the approximation space, our theorems prove that in some cases, there exists an efficient way to evaluate the first-order formulae

    Ultra-Scalable Spectral Clustering and Ensemble Clustering

    Full text link
    This paper focuses on scalability and robustness of spectral clustering for extremely large-scale datasets with limited resources. Two novel algorithms are proposed, namely, ultra-scalable spectral clustering (U-SPEC) and ultra-scalable ensemble clustering (U-SENC). In U-SPEC, a hybrid representative selection strategy and a fast approximation method for K-nearest representatives are proposed for the construction of a sparse affinity sub-matrix. By interpreting the sparse sub-matrix as a bipartite graph, the transfer cut is then utilized to efficiently partition the graph and obtain the clustering result. In U-SENC, multiple U-SPEC clusterers are further integrated into an ensemble clustering framework to enhance the robustness of U-SPEC while maintaining high efficiency. Based on the ensemble generation via multiple U-SEPC's, a new bipartite graph is constructed between objects and base clusters and then efficiently partitioned to achieve the consensus clustering result. It is noteworthy that both U-SPEC and U-SENC have nearly linear time and space complexity, and are capable of robustly and efficiently partitioning ten-million-level nonlinearly-separable datasets on a PC with 64GB memory. Experiments on various large-scale datasets have demonstrated the scalability and robustness of our algorithms. The MATLAB code and experimental data are available at https://www.researchgate.net/publication/330760669.Comment: To appear in IEEE Transactions on Knowledge and Data Engineering, 201

    Convex Clustering via Optimal Mass Transport

    Full text link
    We consider approximating distributions within the framework of optimal mass transport and specialize to the problem of clustering data sets. Distances between distributions are measured in the Wasserstein metric. The main problem we consider is that of approximating sample distributions by ones with sparse support. This provides a new viewpoint to clustering. We propose different relaxations of a cardinality function which penalizes the size of the support set. We establish that a certain relaxation provides the tightest convex lower approximation to the cardinality penalty. We compare the performance of alternative relaxations on a numerical study on clustering.Comment: 12 pages, 12 figure

    Faster Clustering via Preprocessing

    Full text link
    We examine the efficiency of clustering a set of points, when the encompassing metric space may be preprocessed in advance. In computational problems of this genre, there is a first stage of preprocessing, whose input is a collection of points MM; the next stage receives as input a query set Q⊂MQ\subset M, and should report a clustering of QQ according to some objective, such as 1-median, in which case the answer is a point a∈Ma\in M minimizing ∑q∈QdM(a,q)\sum_{q\in Q} d_M(a,q). We design fast algorithms that approximately solve such problems under standard clustering objectives like pp-center and pp-median, when the metric MM has low doubling dimension. By leveraging the preprocessing stage, our algorithms achieve query time that is near-linear in the query size n=∣Q∣n=|Q|, and is (almost) independent of the total number of points m=∣M∣m=|M|.Comment: 24 page

    Dealing with uncertainty: A rough-set-based approach with the background of classical logic

    Get PDF
    The representative-based approximation has been widely studied in rough set theory. Hence, rough set approximations can be defined by the system of representatives, which plays a crucial role in set approximation. In the authors’ previous research a possible use of the similarity-based rough set in first-order logic was investigated. Now our focus has changed to representative-based approximation systems. In this article, the authors show a logical system relying on representative-based set approximation. In our approach, a three-valued partial logic system is introduced. Based on the properties of the approximation space, our theorems prove that in some cases, there exists an efficient way to evaluate the first-order formulae
    • …
    corecore