Search CORE

106 research outputs found

Non-linear ICA based on Cramer-Wold metric

Author: Jastrzębski Stanisław
Maziarka Łukasz
Nowak Aleksandra
Spurek Przemysław
Tabor Jacek
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2019
Field of study

Non-linear source separation is a challenging open problem with many applications. We extend a recently proposed Adversarial Non-linear ICA (ANICA) model, and introduce Cramer-Wold ICA (CW-ICA). In contrast to ANICA we use a simple, closed--form optimization target instead of a discriminator--based independence measure. Our results show that CW-ICA achieves comparable results to ANICA, while foregoing the need for adversarial training

arXiv.org e-Print Archive

Jagiellonian Univeristy Repository

Plant Metabolomics: A Characterisation of Plant Responses to Abiotic Stresses

Author: Consonni Roberto
Coraggio Immacolata
Genga Annamaria
Locatelli Franca
Mattana Monica
Piffanelli Pietro
Publication venue: 'IntechOpen'
Publication date: 22/09/2011
Field of study

IntechOpen

Classification and Separation of Audio and Music Signals

Author: Al-Shoshan Abdullah I.
Publication venue: 'IntechOpen'
Publication date: 15/12/2020
Field of study

This chapter addresses the topic of classification and separation of audio and music signals. It is a very important and a challenging research area. The importance of classification process of a stream of sounds come up for the sake of building two different libraries: speech library and music library. However, the separation process is needed sometimes in a cocktail-party problem to separate speech from music and remove the undesired one. In this chapter, some existed algorithms for the classification process and the separation process are presented and discussed thoroughly. The classification algorithms will be divided into three categories. The first category includes most of the real time approaches. The second category includes most of the frequency domain approaches. However, the third category introduces some of the approaches in the time-frequency distribution. The approaches of time domain discussed in this chapter are the short-time energy (STE), the zero-crossing rate (ZCR), modified version of the ZCR and the STE with positive derivative, the neural networks, and the roll-off variance. The approaches of the frequency spectrum are specifically the roll-off of the spectrum, the spectral centroid and the variance of the spectral centroid, the spectral flux and the variance of the spectral flux, the cepstral residual, and the delta pitch. The time-frequency domain approaches have not been yet tested thoroughly in the process of classification and separation of audio and music signals. Therefore, the spectrogram and the evolutionary spectrum will be introduced and discussed. In addition, some algorithms for separation and segregation of music and audio signals, like the independent Component Analysis, the pitch cancelation and the artificial neural networks will be introduced

IntechOpen

Crossref

On-line quality control in polymer processing using hyperspectral imaging

Author: Gosselin Ryan
Publication venue
Publication date: 16/04/2018
Field of study

L’industrie du plastique se tourne de plus en plus vers les matériaux composites afin d’économiser de la matière et/ou d’utiliser des matières premières à moindres coûts, tout en conservant de bonnes propriétés. L’impressionnante adaptabilité des matériaux composites provient du fait que le manufacturier peut modifier le choix des matériaux utilisés, la proportion selon laquelle ils sont mélangés, ainsi que la méthode de mise en œuvre utilisée. La principale difficulté associée au développement de ces matériaux est l’hétérogénéité de composition ou de structure, qui entraîne généralement des défaillances mécaniques. La qualité des prototypes est normalement mesurée en laboratoire, à partir de tests destructifs et de méthodes nécessitant la préparation des échantillons. La mesure en-ligne de la qualité permettrait une rétroaction quasi-immédiate sur les conditions d’opération des équipements, en plus d’être directement utilisable pour le contrôle de la qualité dans une situation de production industrielle. L’objectif de la recherche proposée consiste à développer un outil de contrôle de qualité pour la qualité des matériaux plastiques de tout genre. Quelques sondes de type proche infrarouge ou ultrasons existent présentement pour la mesure de la composition en-ligne, mais celles-ci ne fournissent qu’une valeur ponctuelle à chaque acquisition. Ce type de méthode est donc mal adapté pour identifier la distribution des caractéristiques de surface de la pièce (i.e. homogénéité, orientation, dispersion). Afin d’atteindre cet objectif, un système d’imagerie hyperspectrale est proposé. À l’aide de cet appareil, il est possible de balayer la surface de la pièce et d’obtenir une image hyperspectrale, c’est-à-dire une image formée de l’intensité lumineuse à des centaines de longueurs d’onde et ce, pour chaque pixel de l’image. L’application de méthodes chimiométriques permettent ensuite d’extraire les caractéristiques spatiales et spectrales de l’échantillon présentes dans ces images. Finalement, les méthodes de régression multivariée permettent d’établir un modèle liant les caractéristiques identifiées aux propriétés de la pièce. La construction d’un modèle mathématique forme donc l’outil d’analyse en-ligne de la qualité des pièces qui peut également prédire et optimiser les conditions de fabrication.The use of plastic composite materials has been increasing in recent years in order to reduce the amount of material used and/or use more economic materials, all of which without compromising the properties. The impressive adaptability of these composite materials comes from the fact that the manufacturer can choose the raw materials, the proportion in which they are blended as well as the processing conditions. However, these materials tend to suffer from heterogeneous compositions and structures, which lead to mechanical weaknesses. Product quality is generally measured in the laboratory, using destructive tests often requiring extensive sample preparation. On-line quality control would allow near-immediate feedback on the operating conditions and may be transferrable to an industrial production context. The proposed research consists of developing an on-line quality control tool adaptable to plastic materials of all types. A number of infrared and ultrasound probes presently exist for on-line composition estimation, but only provide single-point values at each acquisition. These methods are therefore less adapted for identifying the spatial distribution of a sample’s surface characteristics (e.g. homogeneity, orientation, dispersion). In order to achieve this objective, a hyperspectral imaging system is proposed. Using this tool, it is possible to scan the surface of a sample and obtain a hyperspectral image, that is to say an image in which each pixel captures the light intensity at hundreds of wavelengths. Chemometrics methods can then be applied to this image in order to extract the relevant spatial and spectral features. Finally, multivariate regression methods are used to build a model between these features and the properties of the sample. This mathematical model forms the backbone of an on-line quality assessment tool used to predict and optimize the operating conditions under which the samples are processed

CorpusUL

Advances in independent component analysis and nonnegative matrix factorization

Author: Yuan Zhijian
Publication venue: Teknillinen korkeakoulu
Publication date: 01/01/2009
Field of study

A fundamental problem in machine learning research, as well as in many other disciplines, is finding a suitable representation of multivariate data, i.e. random vectors. For reasons of computational and conceptual simplicity, the representation is often sought as a linear transformation of the original data. In other words, each component of the representation is a linear combination of the original variables. Well-known linear transformation methods include principal component analysis (PCA), factor analysis, and projection pursuit. In this thesis, we consider two popular and widely used techniques: independent component analysis (ICA) and nonnegative matrix factorization (NMF). ICA is a statistical method in which the goal is to find a linear representation of nongaussian data so that the components are statistically independent, or as independent as possible. Such a representation seems to capture the essential structure of the data in many applications, including feature extraction and signal separation. Starting from ICA, several methods of estimating the latent structure in different problem settings are derived and presented in this thesis. FastICA as one of most efficient and popular ICA algorithms has been reviewed and discussed. Its local and global convergence and statistical behavior have been further studied. A nonnegative FastICA algorithm is also given in this thesis. Nonnegative matrix factorization is a recently developed technique for finding parts-based, linear representations of non-negative data. It is a method for dimensionality reduction that respects the nonnegativity of the input data while constructing a low-dimensional approximation. The non-negativity constraints make the representation purely additive (allowing no subtractions), in contrast to many other linear representations such as principal component analysis and independent component analysis. A literature survey of Nonnegative matrix factorization is given in this thesis, and a novel method called Projective Nonnegative matrix factorization (P-NMF) and its applications are provided

Aaltodoc Publication Archive

Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows

Author: Du Chao
Li Tianbo
Lin Min
Pang Tianyu
Yan Shuicheng
Publication venue
Publication date: 25/07/2023
Field of study

Sliced-Wasserstein Flow (SWF) is a promising approach to nonparametric generative modeling but has not been widely adopted due to its suboptimal generative quality and lack of conditional modeling capabilities. In this work, we make two major contributions to bridging this gap. First, based on a pleasant observation that (under certain conditions) the SWF of joint distributions coincides with those of conditional distributions, we propose Conditional Sliced-Wasserstein Flow (CSWF), a simple yet effective extension of SWF that enables nonparametric conditional modeling. Second, we introduce appropriate inductive biases of images into SWF with two techniques inspired by local connectivity and multiscale representation in vision research, which greatly improve the efficiency and quality of modeling images. With all the improvements, we achieve generative performance comparable with many deep parametric generative models on both conditional and unconditional tasks in a purely nonparametric fashion, demonstrating its great potential.Comment: ICML 202

arXiv.org e-Print Archive

Statistical Methods to Enhance Clinical Prediction with High-Dimensional Data and Ordinal Response

Author: Leha Andreas
Publication venue
Publication date: 25/03/2015
Field of study

Der technologische Fortschritt ermöglicht es heute, die moleculare Konfiguration einzelner Zellen oder ganzer Gewebeproben zu untersuchen. Solche in großen Mengen produzierten hochdimensionalen Omics-Daten aus der Molekularbiologie lassen sich zu immer niedrigeren Kosten erzeugen und werden so immer häufiger auch in klinischen Fragestellungen eingesetzt. Personalisierte Diagnose oder auch die Vorhersage eines Behandlungserfolges auf der Basis solcher Hochdurchsatzdaten stellen eine moderne Anwendung von Techniken aus dem maschinellen Lernen dar. In der Praxis werden klinische Parameter, wie etwa der Gesundheitszustand oder die Nebenwirkungen einer Therapie, häufig auf einer ordinalen Skala erhoben (beispielsweise gut, normal, schlecht). Es ist verbreitet, Klassifikationsproblme mit ordinal skaliertem Endpunkt wie generelle Mehrklassenproblme zu behandeln und somit die Information, die in der Ordnung zwischen den Klassen enthalten ist, zu ignorieren. Allerdings kann das Vernachlässigen dieser Information zu einer verminderten Klassifikationsgüte führen oder sogar eine ungünstige ungeordnete Klassifikation erzeugen. Klassische Ansätze, einen ordinal skalierten Endpunkt direkt zu modellieren, wie beispielsweise mit einem kumulativen Linkmodell, lassen sich typischerweise nicht auf hochdimensionale Daten anwenden. Wir präsentieren in dieser Arbeit hierarchical twoing (hi2) als einen Algorithmus für die Klassifikation hochdimensionler Daten in ordinal Skalierte Kategorien. hi2 nutzt die Mächtigkeit der sehr gut verstandenen binären Klassifikation, um auch in ordinale Kategorien zu klassifizieren. Eine Opensource-Implementierung von hi2 ist online verfügbar. In einer Vergleichsstudie zur Klassifikation von echten wie von simulierten Daten mit ordinalem Endpunkt produzieren etablierte Methoden, die speziell für geordnete Kategorien entworfen wurden, nicht generell bessere Ergebnisse als state-of-the-art nicht-ordinale Klassifikatoren. Die Fähigkeit eines Algorithmus, mit hochdimensionalen Daten umzugehen, dominiert die Klassifikationsleisting. Wir zeigen, dass unser Algorithmus hi2 konsistent gute Ergebnisse erzielt und in vielen Fällen besser abschneidet als die anderen Methoden

Georg-August-University Göttingen