660 research outputs found

    Unlabeled pattern management through Semi-Supervised classification techniques

    Get PDF
    l'obbiettivo di questo progetto consiste nell'analizzare le performance di alcuni algoritmi di semi-supervised learning proposti negli ultimi anni. In particolare si รจ usato un algoritmo di feature selection basato su Self-training per determinare l'insieme ottimo di features per ogni dataset. Poi sono stati applicati alcuni algoritmi di semi-supervised learning per classificare i dati. Questi algoritmi sono stati testati usando rispettivamente come classificatore di base SVM e SMC

    Master of Science

    Get PDF
    thesisPresently, speech recognition is gaining worldwide popularity in applications like Google Voice, speech-to-text reporter (speech-to-text transcription, video captioning, real-time transcriptions), hands-free computing, and video games. Research has been done for several years and many speech recognizers have been built. However, most of the speech recognizers fail to recognize the speech accurately. Consider the well-known application of Google Voice, which aids in users search of the web using voice. Though Google Voice does a good job in transcribing the spoken words, it does not accurately recognize the words spoken with different accents. With the fact that several accents are evolving around the world, it is essential to train the speech recognizer to recognize accented speech. Accent classification is defined as the problem of classifying the accents in a given language. This thesis explores various methods to identify the accents. We introduce a new concept of clustering windows of a speech signal and learn a distance metric using specific distance measure over phonetic strings to classify the accents. A language structure is incorporated to learn this distance metric. We also show how kernel approximation algorithms help in learning a distance metric

    A detection-based pattern recognition framework and its applications

    Get PDF
    The objective of this dissertation is to present a detection-based pattern recognition framework and demonstrate its applications in automatic speech recognition and broadcast news video story segmentation. Inspired by the studies of modern cognitive psychology and real-world pattern recognition systems, a detection-based pattern recognition framework is proposed to provide an alternative solution for some complicated pattern recognition problems. The primitive features are first detected and the task-specific knowledge hierarchy is constructed level by level; then a variety of heterogeneous information sources are combined together and the high-level context is incorporated as additional information at certain stages. A detection-based framework is a รข divide-and-conquerรข design paradigm for pattern recognition problems, which will decompose a conceptually difficult problem into many elementary sub-problems that can be handled directly and reliably. Some information fusion strategies will be employed to integrate the evidence from a lower level to form the evidence at a higher level. Such a fusion procedure continues until reaching the top level. Generally, a detection-based framework has many advantages: (1) more flexibility in both detector design and fusion strategies, as these two parts can be optimized separately; (2) parallel and distributed computational components in primitive feature detection. In such a component-based framework, any primitive component can be replaced by a new one while other components remain unchanged; (3) incremental information integration; (4) high level context information as additional information sources, which can be combined with bottom-up processing at any stage. This dissertation presents the basic principles, criteria, and techniques for detector design and hypothesis verification based on the statistical detection and decision theory. In addition, evidence fusion strategies were investigated in this dissertation. Several novel detection algorithms and evidence fusion methods were proposed and their effectiveness was justified in automatic speech recognition and broadcast news video segmentation system. We believe such a detection-based framework can be employed in more applications in the future.Ph.D.Committee Chair: Lee, Chin-Hui; Committee Member: Clements, Mark; Committee Member: Ghovanloo, Maysam; Committee Member: Romberg, Justin; Committee Member: Yuan, Min

    The Impact of Emotion Focused Features on SVM and MLR Models for Depression Detection

    Get PDF
    Major depressive disorder (MDD) is a common mental health diagnosis with estimates upwards of 25% of the United States population remain undiagnosed. Psychomotor symptoms of MDD impacts speed of control of the vocal tract, glottal source features and the rhythm of speech. Speech enables people to perceive the emotion of the speaker and MDD decreases the mood magnitudes expressed by an individual. This study asks the questions: โ€œif high level features deigned to combine acoustic features related to emotion detection are added to glottal source features and mean response time in support vector machines and multivariate logistic regression models, would that improve the recall of the MDD class?โ€ To answer this question, a literature review goes through common features in MDD detection, especially features related to emotion recognition. Using feature transformation, emotion recognition composite features are produced and added to glottal source features for model evaluation

    ์ปค๋„ ์„œํฌํŠธ์™€ ํ‰ํ˜•์ ์„ ํ™œ์šฉํ•œ ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ ๋‹ค์ค‘ ํด๋ž˜์Šค ๋ถ„๋ฅ˜ ๊ธฐ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์‚ฐ์—…๊ณตํ•™๊ณผ, 2022.2. ์ด์žฌ์šฑ.In this paper, we propose a multi-class classification method using kernel supports and a dynamic system under differential privacy. We find support vector machine (SVM) algorithms have a fundamental weaknesses of implementing differential privacy because the decision function depends on some subset of the training data called the support vectors. Therefore, we develop a method using interior points called equilibrium points (EPs) without relying on the decision boundary. To construct EPs, we utilize a dynamic system with a new differentially private support vector data description (SVDD) by perturbing the sphere center in the kernel space. Empirical results show that the proposed method achieves better performance even on small-sized datasets where differential privacy performs poorly.๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ปค๋„ ์„œํฌํŠธ์™€ ํ‰ํ˜•์ ์„ ํ™œ์šฉํ•œ ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ ๋‹ค์ค‘ ํด๋ž˜์Šค ๋ถ„๋ฅ˜ ๊ธฐ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ์„œํฌํŠธ ๋ฒกํ„ฐ ๋ถ„๋ฅ˜ ๊ธฐ๋ฒ•์€ ๋ฐ์ดํ„ฐ ๋ถ„์„๊ณผ ๋จธ์‹  ๋Ÿฌ๋‹์— ํ™œ์šฉ์„ฑ์ด ๋†’์•„ ์‚ฌ์šฉ์ž์˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋ณดํ˜ธํ•˜๋ฉฐ ํ•™์Šตํ•˜๋Š” ๊ฒƒ์ด ํ•„์ˆ˜์ ์ด๋‹ค. ๊ทธ ์ค‘ ๊ฐ€์žฅ ๋Œ€์ค‘์ ์ธ ์„œํฌํŠธ ๋ฒกํ„ฐ ๋จธ์‹ (SVM)์€ ์„œํฌํŠธ ๋ฒกํ„ฐ๋ผ๊ณ  ๋ถˆ๋ฆฌ๋Š” ์ผ๋ถ€ ๋ฐ์ดํ„ฐ์—๋งŒ ๋ถ„๋ฅ˜์— ์˜์กดํ•˜๊ธฐ ๋•Œ๋ฌธ์— ํ”„๋ผ์ด๋ฒ„์‹œ ์ฐจ๋ถ„ ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•˜๊ธฐ ์–ด๋ ต๋‹ค. ๋ฐ์ดํ„ฐ ํ•˜๋‚˜๊ฐ€ ๋ณ€๊ฒฝ๋˜์—ˆ์„ ๋•Œ ๊ฒฐ๊ณผ์˜ ๋ณ€ํ™”๊ฐ€ ์ ์–ด์•ผ ํ•˜๋Š” ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ ์ƒํ™ฉ์—์„œ ์„œํฌํŠธ ๋ฒกํ„ฐ ํ•˜๋‚˜๊ฐ€ ์—†์–ด์ง„๋‹ค๋ฉด ๋ถ„๋ฅ˜๊ธฐ์˜ ๊ฒฐ์ • ๊ฒฝ๊ณ„๋Š” ๊ทธ ๋ณ€ํ™”์— ๋งค์šฐ ์ทจ์•ฝํ•˜๋‹ค๋Š” ๋ฌธ์ œ๊ฐ€ ์žˆ๋‹ค. ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํ‰ํ˜•์ ์ด๋ผ๊ณ  ๋ถˆ๋ฆฌ๋Š” ๊ตฐ์ง‘ ๋‚ด๋ถ€์— ์กด์žฌํ•˜๋Š” ์ ์„ ํ™œ์šฉํ•˜๋Š” ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ ๋‹ค์ค‘ ํด๋ž˜์Šค ๋ถ„๋ฅ˜ ๊ธฐ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด, ๋จผ์ € ์ปค๋„ ๊ณต๊ฐ„์—์„œ ๊ตฌ์˜ ์ค‘์‹ฌ์— ์„ญ๋™์„ ๋”ํ•ด ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ๋ฅผ ๋งŒ์กฑํ•˜๋Š” ์„œํฌํŠธ ๋ฒกํ„ฐ ๋ฐ์ดํ„ฐ ๋””์Šคํฌ๋ฆฝ์…˜(SVDD)์„ ๊ตฌํ•˜๊ณ  ์ด๋ฅผ ๋ ˆ๋ฒจ์ง‘ํ•ฉ์œผ๋กœ ํ™œ์šฉํ•ด ๋™์—ญํ•™๊ณ„๋กœ ๊ทน์†Œ์ ๋“ค์„ ๊ตฌํ•œ๋‹ค. ํ‰ํ˜•์ ์„ ํ™œ์šฉํ•˜๊ฑฐ๋‚˜ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ์˜ ๊ฒฝ์šฐ ์ดˆ์ž…๋ฐฉ์ฒด๋ฅผ ๋งŒ๋“ค์–ด, ํ•™์Šตํ•œ ๋ชจ๋ธ์„ ์ถ”๋ก ์— ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋Š” (1) ์„œํฌํŠธ ํ•จ์ˆ˜๋ฅผ ๊ณต๊ฐœ ํ•˜๋Š” ๋ฐฉ๋ฒ•๊ณผ (2) ํ‰ํ˜•์ ์„ ๊ณต๊ฐœํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. 8๊ฐœ์˜ ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ ์ง‘ํ•ฉ์˜ ์‹คํ—˜์ ์ธ ๊ฒฐ๊ณผ๋Š” ์ œ์‹œํ•œ ๋ฐฉ๋ฒ•๋ก ์ด ๋…ธ์ด์ฆˆ์— ๊ฐ•๊ฑดํ•œ ๋‚ด๋ถ€์˜ ์ ์„ ํ™œ์šฉํ•ด ๊ธฐ์กด์˜ ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ ์„œํฌํŠธ ๋ฒกํ„ฐ ๋จธ์‹ ๋ณด๋‹ค ์„ฑ๋Šฅ์„ ๋†’์ด๊ณ , ์ฐจ๋ถ„ ํ”„๋ผ์ด๋ฒ„์‹œ๊ฐ€ ์ ์šฉ๋˜๊ธฐ ์–ด๋ ค์šด ์ž‘์€ ๋ฐ์ดํ„ฐ์…‹์—๋„ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ธฐ์ˆ ์ž„์„ ๋ณด์—ฌ์ค€๋‹ค.Chapter 1 Introduction 1 1.1 Problem Description: Data Privacy 1 1.2 The Privacy of Support Vector Methods 2 1.3 Research Motivation and Contribution 4 1.4 Organization of the Thesis 5 Chapter 2 Literature Review 6 2.1 Differentially private Empirical risk minimization 6 2.2 Differentially private Support vector machine 7 Chapter 3 Preliminaries 9 3.1 Differential privacy 9 Chapter 4 Differential private support vector data description 12 4.1 Support vector data description 12 4.2 Differentially private support vector data description 13 Chapter 5 Differentially private multi-class classification utilizing SVDD 19 5.1 Phase I. Constructing a private support level function 20 5.2 Phase II: Differentially private clustering on the data space via a dynamical system 21 5.3 Phase III: Classifying the decomposed regions under differential privacy 22 Chapter 6 Inference scenarios and releasing the differentially private model 25 6.1 Publishing support function 26 6.2 Releasing equilibrium points 26 6.3 Comparison to previous methods 27 Chapter 7 Experiments 28 7.1 Models and Scenario setting 28 7.2 Datasets 29 7.3 Experimental settings 29 7.4 Empirical results on various datasets under publishing support function 30 7.5 Evaluating robustness under diverse data size 33 7.6 Inference through equilibrium points 33 Chapter 8 Conclusion 34 8.1 Conclusion 34์„
    • โ€ฆ
    corecore