4,353 research outputs found

    Unsupervised ensemble minority clustering

    Get PDF
    Cluster a alysis lies at the core of most unsupervised learning tasks. However, the majority of clustering algorithms depend on the all-in assumption, in which all objects belong to some cluster, and perform poorly on minority clustering tasks, in which a small fraction of signal data stands against a majority of noise. The approaches proposed so far for minority clustering are supervised: they require the number and distribution of the foreground and background clusters. In supervised learning and all-in clustering, combination methods have been successfully applied to obtain distribution-free learners, even from the output of weak individual algorithms. In this report, we present a novel ensemble minority clustering algorithm, Ewocs, suitable for weak clustering combination, and provide a theoretical proof of its properties under a loose set of constraints. The validity of the assumptions used in the proof is empirically assessed using a collection of synthetic datasets.Preprin

    Classification of fresh edible oils using a coated piezoelectric sensor array-based electronic nose with soft computing approach for pattern recognition

    Get PDF
    An electronic nose based on an array of six bulk acoustic wave polymer-coated piezoelectric quartz (PZQ) sensors with soft computing-based pattern recognition was used for the classi-fication of edible oils. The electronic nose was presented with 346 samples of fresh edible oil headspace volatiles, generated at 45°C. Extra virgin olive (EVO), nonvirgin olive oil (NVO) and sunflower oil (SFO) were used over a period of 30 days. The sensor responses were visualized by plotting the results from principal component analysis (PCA). Classification of edible oils was carried out using fuzzy c-means as well as radial basis function (RBF) neural networks both from a raw data and data after having been preprocessed by fuzzy c-means. The fuzzy c-means results were poor (74%) due to the different cluster sizes. The result of RBF with fuzzy c-means preprocessing was 95% and 99% for raw data input. RBF networks with fuzzy c-means preprocessing provide the advantage of a simple architecture that is quicker to train.</p
    • …
    corecore