25,709 research outputs found

    The Detection of Stress Corrosion Cracking in Natural Gas Pipelines Using Electromagnetic Acoustic Transducers

    Get PDF
    This thesis describes the refinement of a non-destructive, in-line inspection system sensor for the detection of stress corrosion cracks (SCCs) in natural gas pipelines. The sensors are prototype electromagnetic acoustic transducers (EMATs) for noncontact ultrasonic inspection. The focus areas discussed involve the statistically validated performance improvements achieved through the addition of 12 more features, the addition of Principal Component Analysis plus Linear Discriminant Analysis (PCA+LDA) to the classification algorithm, and most significantly the creating of a training set. The training set allowed PCA+LDA to be included in the classification algorithm, as well as allowing one set of no-flaw signature features, one PCA projection matrix, and one LDA projection matrix to be used on multiple pipes and on multiple scanned paths from a pipe. A discrete wavelet decomposition is used to separate the frequency content of each EMAT sample (signature) into five distinct bands. From these decomposed signatures, features are extracted for classification. The classification begins with the projection of the features using the PCA projection matrix derived from the training set, immediately followed by the projection of the PCA projected features using the LDA projection matrix that was also derived from the training set. Finally, the PCA+LDA projected features are classified based on their Mahalanobis distances from the PCA+LDA projected no-flaw training set features. Using the improved feature set and this classification procedure, SCC identification improved 14% and there was an 80% reduction in the number of false positives. In addition, there was a 30% improvement in the detection of the most critical SCCs. SCCs whose average through wall depths were between 35% and 54%

    Weighted LDA techniques for I-vector based speaker verification

    Get PDF
    This paper introduces the Weighted Linear Discriminant Analysis (WLDA) technique, based upon the weighted pairwise Fisher criterion, for the purposes of improving i-vector speaker verification in the presence of high intersession variability. By taking advantage of the speaker discriminative information that is available in the distances between pairs of speakers clustered in the development i-vector space, the WLDA technique is shown to provide an improvement in speaker verification performance over traditional Linear Discriminant Analysis (LDA) approaches. A similar approach is also taken to extend the recently developed Source Normalised LDA (SNLDA) into Weighted SNLDA (WSNLDA) which, similarly, shows an improvement in speaker verification performance in both matched and mismatched enrolment/verification conditions. Based upon the results presented within this paper using the NIST 2008 Speaker Recognition Evaluation dataset, we believe that both WLDA and WSNLDA are viable as replacement techniques to improve the performance of LDA and SNLDA-based i-vector speaker verification

    Do unbalanced data have a negative effect on LDA?

    Get PDF
    For two-class discrimination, Xie and Qiu [The effect of imbalanced data sets on LDA: a theoretical and empirical analysis, Pattern Recognition 40 (2) (2007) 557–562] claimed that, when covariance matrices of the two classes were unequal, a (class) unbalanced data set had a negative effect on the performance of linear discriminant analysis (LDA). Through re-balancing 10 real-world data sets, Xie and Qiu [The effect of imbalanced data sets on LDA: a theoretical and empirical analysis, Pattern Recognition 40 (2) (2007) 557–562] provided empirical evidence to support the claim using AUC (Area Under the receiver operating characteristic Curve) as the performance metric. We suggest that such a claim is vague if not misleading, there is no solid theoretical analysis presented in Xie and Qiu [The effect of imbalanced data sets on LDA: a theoretical and empirical analysis, Pattern Recognition 40 (2) (2007) 557–562], and AUC can lead to a quite different conclusion from that led to by misclassification error rate (ER) on the discrimination performance of LDA for unbalanced data sets. Our empirical and simulation studies suggest that, for LDA, the increase of the median of AUC (and thus the improvement of performance of LDA) from re-balancing is relatively small, while, in contrast, the increase of the median of ER (and thus the decline in performance of LDA) from re-balancing is relatively large. Therefore, from our study, there is no reliable empirical evidence to support the claim that a (class) unbalanced data set has a negative effect on the performance of LDA. In addition, re-balancing affects the performance of LDA for data sets with either equal or unequal covariance matrices, indicating that having unequal covariance matrices is not a key reason for the difference in performance between original and re-balanced data

    Implicitly Constrained Semi-Supervised Linear Discriminant Analysis

    Full text link
    Semi-supervised learning is an important and active topic of research in pattern recognition. For classification using linear discriminant analysis specifically, several semi-supervised variants have been proposed. Using any one of these methods is not guaranteed to outperform the supervised classifier which does not take the additional unlabeled data into account. In this work we compare traditional Expectation Maximization type approaches for semi-supervised linear discriminant analysis with approaches based on intrinsic constraints and propose a new principled approach for semi-supervised linear discriminant analysis, using so-called implicit constraints. We explore the relationships between these methods and consider the question if and in what sense we can expect improvement in performance over the supervised procedure. The constraint based approaches are more robust to misspecification of the model, and may outperform alternatives that make more assumptions on the data, in terms of the log-likelihood of unseen objects.Comment: 6 pages, 3 figures and 3 tables. International Conference on Pattern Recognition (ICPR) 2014, Stockholm, Swede
    corecore