517 research outputs found

    Graph Signal Processing: Overview, Challenges and Applications

    Full text link
    Research in Graph Signal Processing (GSP) aims to develop tools for processing data defined on irregular graph domains. In this paper we first provide an overview of core ideas in GSP and their connection to conventional digital signal processing. We then summarize recent developments in developing basic GSP tools, including methods for sampling, filtering or graph learning. Next, we review progress in several application areas using GSP, including processing and analysis of sensor network data, biological data, and applications to image processing and machine learning. We finish by providing a brief historical perspective to highlight how concepts recently developed in GSP build on top of prior research in other areas.Comment: To appear, Proceedings of the IEE

    Methods and Systems for Fault Diagnosis in Nuclear Power Plants

    Get PDF
    This research mainly deals with fault diagnosis in nuclear power plants (NPP), based on a framework that integrates contributions from fault scope identification, optimal sensor placement, sensor validation, equipment condition monitoring, and diagnostic reasoning based on pattern analysis. The research has a particular focus on applications where data collected from the existing SCADA (supervisory, control, and data acquisition) system is not sufficient for the fault diagnosis system. Specifically, the following methods and systems are developed. A sensor placement model is developed to guide optimal placement of sensors in NPPs. The model includes 1) a method to extract a quantitative fault-sensor incidence matrix for a system; 2) a fault diagnosability criterion based on the degree of singularities of the incidence matrix; and 3) procedures to place additional sensors to meet the diagnosability criterion. Usefulness of the proposed method is demonstrated on a nuclear power plant process control test facility (NPCTF). Experimental results show that three pairs of undiagnosable faults can be effectively distinguished with three additional sensors selected by the proposed model. A wireless sensor network (WSN) is designed and a prototype is implemented on the NPCTF. WSN is an effective tool to collect data for fault diagnosis, especially for systems where additional measurements are needed. The WSN has distributed data processing and information fusion for fault diagnosis. Experimental results on the NPCTF show that the WSN system can be used to diagnose all six fault scenarios considered for the system. A fault diagnosis method based on semi-supervised pattern classification is developed which requires significantly fewer training data than is typically required in existing fault diagnosis models. It is a promising tool for applications in NPPs, where it is usually difficult to obtain training data under fault conditions for a conventional fault diagnosis model. The proposed method has successfully diagnosed nine types of faults physically simulated on the NPCTF. For equipment condition monitoring, a modified S-transform (MST) algorithm is developed by using shaping functions, particularly sigmoid functions, to modify the window width of the existing standard S-transform. The MST can achieve superior time-frequency resolution for applications that involves non-stationary multi-modal signals, where classical methods may fail. Effectiveness of the proposed algorithm is demonstrated using a vibration test system as well as applications to detect a collapsed pipe support in the NPCTF. The experimental results show that by observing changes in time-frequency characteristics of vibration signals, one can effectively detect faults occurred in components of an industrial system. To ensure that a fault diagnosis system does not suffer from erroneous data, a fault detection and isolation (FDI) method based on kernel principal component analysis (KPCA) is extended for sensor validations, where sensor faults are detected and isolated from the reconstruction errors of a KPCA model. The method is validated using measurement data from a physical NPP. The NPCTF is designed and constructed in this research for experimental validations of fault diagnosis methods and systems. Faults can be physically simulated on the NPCTF. In addition, the NPCTF is designed to support systems based on different instrumentation and control technologies such as WSN and distributed control systems. The NPCTF has been successfully utilized to validate the algorithms and WSN system developed in this research. In a real world application, it is seldom the case that one single fault diagnostic scheme can meet all the requirements of a fault diagnostic system in a nuclear power. In fact, the values and performance of the diagnosis system can potentially be enhanced if some of the methods developed in this thesis can be integrated into a suite of diagnostic tools. In such an integrated system, WSN nodes can be used to collect additional data deemed necessary by sensor placement models. These data can be integrated with those from existing SCADA systems for more comprehensive fault diagnosis. An online performance monitoring system monitors the conditions of the equipment and provides key information for the tasks of condition-based maintenance. When a fault is detected, the measured data are subsequently acquired and analyzed by pattern classification models to identify the nature of the fault. By analyzing the symptoms of the fault, root causes of the fault can eventually be identified

    Learning to transform time series with a few examples

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2006.Also issued as printed in pages.MIT Barker Engineering Library copy: printed in pages.Includes bibliographical references (leaves 113-119).I describe a semi-supervised regression algorithm that learns to transform one time series into another time series given examples of the transformation. I apply this algorithm to tracking, where one transforms a time series of observations from sensors to a time series describing the pose of a target. Instead of defining and implementing such transformations for each tracking task separately, I suggest learning a memoryless transformations of time series from a few example input-output mappings. The algorithm searches for a smooth function that fits the training examples and, when applied to the input time series, produces a time series that evolves according to assumed dynamics. The learning procedure is fast and lends itself to a closed-form solution. I relate this algorithm and its unsupervised extension to nonlinear system identification and manifold learning techniques. I demonstrate it on the tasks of tracking RFID tags from signal strength measurements, recovering the pose of rigid objects, deformable bodies, and articulated bodies from video sequences, and tracking a target in a completely uncalibrated network of sensors. For these tasks, this algorithm requires significantly fewer examples compared to fully-supervised regression algorithms or semi-supervised learning algorithms that do not take the dynamics of the output time series into account.by Ali RahimiPh.D

    Localization as a Key Enabler of 6G Wireless Systems: A Comprehensive Survey and an Outlook

    Get PDF
    peer reviewedWhen fully implemented, sixth generation (6G) wireless systems will constitute intelligent wireless networks that enable not only ubiquitous communication but also high-Accuracy localization services. They will be the driving force behind this transformation by introducing a new set of characteristics and service capabilities in which location will coexist with communication while sharing available resources. To that purpose, this survey investigates the envisioned applications and use cases of localization in future 6G wireless systems, while analyzing the impact of the major technology enablers. Afterwards, system models for millimeter wave, terahertz and visible light positioning that take into account both line-of-sight (LOS) and non-LOS channels are presented, while localization key performance indicators are revisited alongside mathematical definitions. Moreover, a detailed review of the state of the art conventional and learning-based localization techniques is conducted. Furthermore, the localization problem is formulated, the wireless system design is considered and the optimization of both is investigated. Finally, insights that arise from the presented analysis are summarized and used to highlight the most important future directions for localization in 6G wireless systems

    ๋ฆฌ๋งŒ ์ตœ์ ํ™”์™€ ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง์— ๊ธฐ๋ฐ˜ํ•œ ์ € ๋žญํฌ ํ–‰๋ ฌ์™„์„ฑ ์•Œ๊ณ ๋ฆฌ๋“ฌ์— ๊ด€ํ•œ ์—ฐ๊ตฌ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€,2020. 2. ์‹ฌ๋ณ‘ํšจ.์ตœ๊ทผ, ์ผ๋ถ€์˜ ๊ด€์ธก์น˜๋กœ๋ถ€ํ„ฐ ํ–‰๋ ฌ์˜ ๋ชจ๋“  ์›์†Œ๋“ค์„ ๋ณต์›ํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ ์ € ๋žญํฌ ํ–‰๋ ฌ ์™„์„ฑ (LRMC)์ด ๋งŽ์€ ์ฃผ๋ชฉ์„ ๋ฐ›๊ณ  ์žˆ๋‹ค. LRMC๋Š” ์ถ”์ฒœ ์‹œ์Šคํ…œ, ์œ„์ƒ ๋ณต์›, ์‚ฌ๋ฌผ ์ธํ„ฐ๋„ท ์ง€์—ญํ™”, ์˜์ƒ ์žก์Œ ์ œ๊ฑฐ, ๋ฐ€๋ฆฌ๋ฏธํ„ฐ ์›จ์ด๋ธŒ ํ†ต ์‹ ๋“ฑ์„ ํฌํ•จํ•œ ๋‹ค์–‘ํ•œ ์‘์šฉ๋ถ„์•ผ์—์„œ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” LRMC์— ๋Œ€ํ•ด ์—ฐ๊ตฌํ•˜์—ฌ LRMC์˜ ๊ฐ€๋Šฅ์„ฑ๊ณผ ํ•œ๊ณ„์— ๋Œ€ํ•œ ๋” ๋‚˜์€ ์ดํ•ด๋ฅผ ํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ธฐ์กด ๊ฒฐ๊ณผ๋“ค์„ ๊ตฌ์กฐ์ ์ด๊ณ  ์ ‘๊ทผ ๊ฐ€๋Šฅํ•œ ๋ฐฉ์‹์œผ๋กœ ๋ถ„๋ฅ˜ํ•œ๋‹ค. ๊ตฌ์ฒด์ ์œผ๋กœ, ์ตœ์‹  LRMC ๊ธฐ๋ฒ•๋“ค์„ ๋‘ ๊ฐ€์ง€ ๋ฒ”์ฃผ๋กœ ๋ถ„๋ฅ˜ํ•œ ๋‹ค์Œ ๊ฐ๊ฐ ์˜๋ฒ”์ฃผ๋ฅผ ๋ถ„์„ํ•œ๋‹ค. ํŠนํžˆ, ํ–‰๋ ฌ์˜ ๊ณ ์œ ํ•œ ์„ฑ์งˆ๊ณผ ๊ฐ™์€ LRMC ๊ธฐ๋ฒ•์„ ์‚ฌ์šฉ ํ• ๋•Œ ๊ณ ๋ คํ•ด์•ผ ํ•  ์‚ฌํ•ญ๋“ค์„ ๋ถ„์„ํ•œ๋‹ค. ๊ธฐ์กด์˜ LRMC ๊ธฐ๋ฒ•์€ ๊ฐ€์šฐ์‹œ์•ˆ ๋žœ ๋คํ–‰๋ ฌ๊ณผ ๊ฐ™์€ ์ผ๋ฐ˜์ ์ธ ์ƒํ™ฉ์—์„œ ์„ฑ๊ณต์ ์ด์—ˆ์œผ๋‚˜ ๋งŽ์€ ์‹ค์ œ ์ƒํ™ฉ์—์„œ ๋Š”๋ณต์›ํ•˜๊ณ ์ž ํ•˜๋Š” ์ € ๋žญํฌ ํ–‰๋ ฌ์ด ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ ๋˜๋Š” ๋‹ค์–‘์ฒด ๊ตฌ์กฐ์™€ ๊ฐ™์€ ๋น„์œ ํด๋ฆฌ๋“œ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์งˆ ์ˆ˜ ์žˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์‹ค์ œ ์‘์šฉ์—์„œ LRMC์˜ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ์ด ๋Ÿฐ์ถ”๊ฐ€์ ์ธ ๊ตฌ์กฐ๊ฐ€ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์ธ๋‹ค. ํŠนํžˆ, ์‚ฌ๋ฌผ ์ธํ„ฐ๋„ท ๋„คํŠธ์›Œ ํฌ์ง€์—ญํ™”๋ฅผ ์œ„ํ•œ ์œ ํด๋ฆฌ๋“œ ๊ฑฐ๋ฆฌ ํ–‰๋ ฌ ์™„์„ฑ ์•Œ๊ณ ๋ฆฌ๋“ฌ์„ ์ œ์•ˆํ•œ๋‹ค. ์œ ํด๋ฆฌ ๋“œ๊ฑฐ๋ฆฌ ํ–‰๋ ฌ์„ ๋‚ฎ์€ ๋žญํฌ๋ฅผ ๊ฐ–๋Š” ์–‘์˜ ์ค€์ •๋ถ€ํ˜ธ ํ–‰๋ ฌ์˜ ํ•จ์ˆ˜๋กœ ํ‘œํ˜„ํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ์–‘์˜ ์ค€์ •๋ถ€ํ˜ธ ํ–‰๋ ฌ๋“ค์˜ ์ง‘ํ•ฉ์€ ๋ฏธ๋ถ„์ด ์ž˜ ์ •์˜๋˜์–ด ์žˆ๋Š” ๋ฆฌ ๋งŒ๋‹ค์–‘์ฒด๋ฅผ ํ˜•์„ฑํ•˜๋ฏ€๋กœ ์œ ํด๋ฆฌ๋“œ ๊ณต๊ฐ„์—์„œ์˜ ์•Œ๊ณ ๋ฆฌ๋“ฌ์„ ์ ๋‹นํžˆ ๋ณ€ํ˜•ํ•˜ ์—ฌLRMC์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค. LRMC๋ฅผ ์œ„ํ•ด ์šฐ๋ฆฌ๋Š” ์ผค๋ ˆ ๊ธฐ์šธ๊ธฐ๋ฅผ ํ™œ์šฉ ํ•œ๋ฆฌ๋งŒ ๋‹ค์–‘์ฒด์—์„œ์˜ ์ง€์—ญํ™” (LRM-CG)๋ผ ๋ถˆ๋ฆฌ๋Š” ๋ณ€๊ฒฝ๋œ ์ผค๋ ˆ ๊ธฐ์šธ๊ธฐ ๊ธฐ ๋ฐ˜์•Œ๊ณ ๋ฆฌ๋“ฌ์„ ์ œ์•ˆํ•œ๋‹ค. ์ œ์•ˆํ•˜๋Š” LRM-CG ์•Œ๊ณ ๋ฆฌ๋“ฌ์€ ๊ด€์ธก๋œ ์Œ ๊ฑฐ๋ฆฌ ๊ฐ€ํŠน์ด๊ฐ’์— ์˜ํ•ด ์˜ค์—ผ๋˜๋Š” ์‹œ๋‚˜๋ฆฌ์˜ค๋กœ ์‰ฝ๊ฒŒ ํ™•์žฅ ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์ธ๋‹ค. ์‹ค์ œ๋กœ ํŠน์ด๊ฐ’์„ ํฌ์†Œ ํ–‰๋ ฌ๋กœ ๋ชจ๋ธ๋ง ํ•œ ๋‹ค์Œ ํŠน์ด๊ฐ’ ํ–‰๋ ฌ์„ ๊ทœ์ œ ํ•ญ์œผ ๋กœLRMC์— ์ถ”๊ฐ€ํ•จ์œผ๋กœ์จ ํŠน์ด๊ฐ’์„ ํšจ๊ณผ์ ์œผ๋กœ ์ œ์–ด ํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ถ„์„์„ ํ†ต ํ•ดLRM-CG ์•Œ๊ณ ๋ฆฌ๋“ฌ์ด ํ™•์žฅ๋œ Wolfe ์กฐ๊ฑด ์•„๋ž˜ ์›๋ž˜ ์œ ํด๋ฆฌ๋“œ ๊ฑฐ๋ฆฌ ํ–‰๋ ฌ ์—์„ ํ˜•์ ์œผ๋กœ ์ˆ˜๋ ดํ•˜๋Š” ๊ฒƒ์„ ๋ณด์ธ๋‹ค. ๋ชจ์˜ ์‹คํ—˜์„ ํ†ตํ•ด LRM-CG์™€ ํ™• ์žฅ๋ฒ„์ „์ด ์œ ํด๋ฆฌ๋“œ ๊ฑฐ๋ฆฌ ํ–‰๋ ฌ์„ ๋ณต๊ตฌํ•˜๋Š” ๋ฐ ํšจ๊ณผ์ ์ž„์„ ๋ณด์ธ๋‹ค. ๋˜ํ•œ, ๊ทธ๋ž˜ํ”„ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ ํ‘œํ˜„๋  ์ˆ˜ ์žˆ๋Š” ์ € ๋žญํฌ ํ–‰๋ ฌ ๋ณต์›์„ ์œ„ ํ•œ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง (GNN) ๊ธฐ๋ฐ˜ ๊ธฐ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. ๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜์˜ LRM C(GNN-LRMC)๋ผ ๋ถˆ๋ฆฌ๋Š” ๊ธฐ๋ฒ•์€ ๋ณต์›ํ•˜๊ณ ์ž ํ•˜๋Š” ํ–‰๋ ฌ์˜ ๊ทธ๋ž˜ํ”„ ์˜ ์—ญํŠน์ง•๋“ค์„ ์ถ”์ถœํ•˜๊ธฐ ์œ„ํ•ด ๋ณ€ํ˜•๋œ ํ•ฉ์„ฑ๊ณฑ ์—ฐ์‚ฐ์„ ์‚ฌ์šฉํ•œ๋‹ค. ์ด๋ ‡๊ฒŒ ์ถ”์ถœ ๋œํŠน์ง•๋“ค์„ GNN์˜ ํ•™์Šต ๊ณผ์ •์— ํ™œ์šฉํ•˜์—ฌ ํ–‰๋ ฌ์˜ ์›์†Œ๋“ค์„ ๋ณต์›ํ•  ์ˆ˜ ์žˆ๋‹ค. ํ•ฉ์„ฑ ๋ฐ ์‹ค์ œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•œ ๋ชจ์˜ ์‹คํ—˜์„ ํ†ตํ•˜์—ฌ ์ œ์•ˆํ•˜๋Š” GNN -LRMC์˜ ์šฐ์ˆ˜ํ•œ ๋ณต๊ตฌ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค.In recent years, low-rank matrix completion (LRMC) has received much attention as a paradigm to recover the unknown entries of a matrix from partial observations. It has a wide range of applications in many areas, including recommendation system, phase retrieval, IoT localization, image denoising, milimeter wave (mmWave) communication, to name just a few. In this dissertation, we present a comprehensive overview of low-rank matrix completion. In order to have better view, insight, and understanding of potentials and limitations of LRMC, we present early scattered results in a structured and accessible way. To be specific, we classify the state-of-the-art LRMC techniques into two main categories and then explain each category in detail. We further discuss issues to be considered, including intrinsic properties required for the matrix recovery, when one would like to use LRMC techniques. However, conventional LRMC techniques have been most successful on a general setting of the low-rank matrix, say, Gaussian random matrix. In many practical situations, the desired low rank matrix might have an underlying non-Euclidean structure, such as graph or manifold structure. In our work, we show that such additional data structures can be exploited to improve the recovery performance of LRMC in real-life applications. In particular, we propose a Euclidean distance matrix completion algorithm for internet of things (IoT) network localization. In our approach, we express the Euclidean distance matrix as a function of the low rank positive semidefinite (PSD) matrix. Since the set of these PSD matrices forms a Riemannian manifold in which the notation of differentiability can be defined, we can recycle, after a proper modification, an algorithm in the Euclidean space. In order to solve the low-rank matrix completion, we propose a modified conjugate gradient algorithm, referred to as localization in Riemannian manifold using conjugate gradient (LRM-CG). We also show that the proposed LRM-CG algorithm can be easily extended to the scenario in which the observed pairwise distances are contaminated by the outliers. In fact, by modeling outliers as a sparse matrix and then adding a regularization term of the outlier matrix into the low-rank matrix completion problem, we can effectively control the outliers. From the convergence analysis, we show that LRM-CG converges linearly to the original Euclidean distance matrix under the extended Wolfes conditions. From the numerical experiments, we demonstrate that LRM-CG as well as its extended version is effective in recovering the Euclidean distance matrix. In order to solve the LRMC problem in which the desired low-rank matrix can be expressed using a graph model, we also propose a graph neural network (GNN) scheme. Our approach, referred to as graph neural network-based low-rank matrix completion (GNN-LRMC), is to use a modified convolution operation to extract the features across the graph domain. The feature data enable the training process of the proposed GNN to reconstruct the unknown entries and also optimize the graph model of the desired low-rank matrix. We demonstrate the reconstruction performance of the proposed GNN-LRMC using synthetic and real-life datasets.Abstract i Contents iii List of Tables vii List of Figures viii 1 Introduction 2 1.1 Motivation 2 1.2 Outline of the dissertation 5 2 Low-Rank Matrix Completion 6 2.1 LRMC Applications 6 2.1.1 Recommendation system 6 2.1.2 Phase retrieval 8 2.1.3 Localization in IoT networks 8 2.1.4 Image compression and restoration 10 2.1.5 Massive multiple-input multiple-output (MIMO) 12 2.1.6 Millimeter wave (mmWave) communication 12 2.2 Intrinsic Properties of LRMC 13 2.2.1 Sparsity of Observed Entries 13 2.2.2 Coherence 18 2.3 Rank Minimization Problem 22 2.4 LRMC Algorithms Without the Rank Information 25 2.4.1 Nuclear Norm Minimization (NNM) 25 2.4.2 Singular Value Thresholding (SVT) 28 2.4.3 Iteratively Reweighted Least Squares (IRLS) Minimization 31 2.5 LRMC Algorithms Using Rank Information 32 2.5.1 Greedy Techniques 34 2.5.2 Alternating Minimization Techniques 37 2.5.3 Optimization over Smooth Riemannian Manifold 39 2.5.4 Truncated NNM 41 2.6 Performance Guarantee 44 2.7 Empirical Performance Evaluation 46 2.8 Choosing the Right Matrix Completion Algorithms 55 3 IoT Localization Via LRMC 56 3.1 Problem Model 57 3.2 Optimization over Riemannian Manifold 61 3.3 Localization in Riemannian Manifold Using Conjugate Gradient (LRMCG) 66 3.4 Computational Complexity 71 3.5 Recovery Condition Analysis 73 3.5.1 Convergence of LRM-CG at Sampled Entries 73 3.5.2 Exact Recovery of Euclidean Distance Matrices 79 3.5.3 Discussion on A3 86 4 Extended LRM-CG for The Outlier Problem 92 4.1 Problem Model 94 4.2 Extended LRM-CG 94 4.3 Numerical Evaluation 97 4.3.1 Simulation Setting 98 4.3.2 Convergence Efficiency 99 4.3.3 Performance Evaluation 99 4.3.4 Outlier Problem 107 4.3.5 Real Data 107 5 LRMC Via Graph Neural Network 112 5.1 Graph Model 116 5.2 Proposed GNN-LRMC 116 5.2.1 Adaptive Model 119 5.2.2 Multilayer GNN 119 5.2.3 Output Model 122 5.2.4 Training Cost Function 123 5.3 Numerical Evaluation 123 6 Conculsion 127 A Proof of Lemma 6 129 B Proof of Theorem 7 131 C Proof of Lemma 8 134 D Proof of Theorem 9 136 E Proof of Lemma 10 140 F Proof of Lemma 12 141 G Proof of Lemma 13 142 H Proof of Lemma 14 144 I Proof of Lemma 15 146 J Proof of Lemma 17 151 K Proof of Lemma 19 154 L Proof of Lemma 20 156 M Proof of Lemma 21 158 Abstract (In Korean) 173 Acknowlegement 175Docto

    Dimensionality reduction and sparse representations in computer vision

    Get PDF
    The proliferation of camera equipped devices, such as netbooks, smartphones and game stations, has led to a significant increase in the production of visual content. This visual information could be used for understanding the environment and offering a natural interface between the users and their surroundings. However, the massive amounts of data and the high computational cost associated with them, encumbers the transfer of sophisticated vision algorithms to real life systems, especially ones that exhibit resource limitations such as restrictions in available memory, processing power and bandwidth. One approach for tackling these issues is to generate compact and descriptive representations of image data by exploiting inherent redundancies. We propose the investigation of dimensionality reduction and sparse representations in order to accomplish this task. In dimensionality reduction, the aim is to reduce the dimensions of the space where image data reside in order to allow resource constrained systems to handle them and, ideally, provide a more insightful description. This goal is achieved by exploiting the inherent redundancies that many classes of images, such as faces under different illumination conditions and objects from different viewpoints, exhibit. We explore the description of natural images by low dimensional non-linear models called image manifolds and investigate the performance of computer vision tasks such as recognition and classification using these low dimensional models. In addition to dimensionality reduction, we study a novel approach in representing images as a sparse linear combination of dictionary examples. We investigate how sparse image representations can be used for a variety of tasks including low level image modeling and higher level semantic information extraction. Using tools from dimensionality reduction and sparse representation, we propose the application of these methods in three hierarchical image layers, namely low-level features, mid-level structures and high-level attributes. Low level features are image descriptors that can be extracted directly from the raw image pixels and include pixel intensities, histograms, and gradients. In the first part of this work, we explore how various techniques in dimensionality reduction, ranging from traditional image compression to the recently proposed Random Projections method, affect the performance of computer vision algorithms such as face detection and face recognition. In addition, we discuss a method that is able to increase the spatial resolution of a single image, without using any training examples, according to the sparse representations framework. In the second part, we explore mid-level structures, including image manifolds and sparse models, produced by abstracting information from low-level features and offer compact modeling of high dimensional data. We propose novel techniques for generating more descriptive image representations and investigate their application in face recognition and object tracking. In the third part of this work, we propose the investigation of a novel framework for representing the semantic contents of images. This framework employs high level semantic attributes that aim to bridge the gap between the visual information of an image and its textual description by utilizing low level features and mid level structures. This innovative paradigm offers revolutionary possibilities including recognizing the category of an object from purely textual information without providing any explicit visual example
    • โ€ฆ
    corecore